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CHAPTER 4 


Stochastic differential equations 


Some pertinent topics in the present chapter consist of a discussion on mar¬ 
tingale theory, and a few relevant results on stochastic differential equations in 
spaces of finite dimension. In particular unique weak solutions to stochastic dif¬ 
ferential equations give rise to strong Markov processes whose one-dimensional 
distributions are governed by the corresponding second order parabolic type 
differential equation. Essentially speaking this chapter is part of Chapter 1 in 
[146]. (The author is thankful to WSPC for the permission to include this text 
also in the present book.) In this chapter we discuss weak and strong solutions 
to stochastic differential equations. We also discuss a version of the Girsanov 
transformation. 


1. Solutions to stochastic differential equations 


Basically, the material in this section is taken from Ikeda and Watanabe [61]. 
In Subsection 1.1 we begin with a discussion on strong solutions to stochastic 
differential equations, after that, in Subsection 1.2 we present a martingale 
characterization of Brownian motion. We also pay some attention to (local) 
exponential martingales: see Subsection 1.3. In Subsection 1.4 the notion of 
weak solutions is explained. However, first we give a definition of Brownian 
motion which starts at a random position. 

4.1. Definition. Let (f2, £F, P) be a probability space with filtration (Tt) t>0 . 
A d-dimensional Brownian motion is a almost everywhere continuous adapted 
process {B(t) = ..., Ba(t )) : t ^ 0} such that for 0 < t\ < t 2 < ■ ■ ■ < 


t n < oo and for C any Borel subset of (PA)" the following equality holds: 
P [{B (G) - B( 0),..., B (t n ) - B( 0)) e C] 



J P0,d (tn t n — i, X n —i , X n ) • • • Po : d (^2 t\ , X\ , X2) Po : d (t\ , 0, 3q) 

c 

dx 1 ... dx n . 


(4.1) 


This process is called a d-dimensional Brownian motion with initial distribution 


if for 0 < ti < t 2 < • • • < G < 00 and every Borel subset of (M d ) n+1 the 
following equality holds: 


Pp(0) ) B(f 1 ),...,5(f n ))eC] 



J P0,d {tn tn— 1 5 %n— 1 5 •^n) ’ ’ ' P0,d (^2 1 %1 5 ^ 2 ) P0,d (^1 5 ^0 5 ^l) 

C 
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dp (xo) dx i... dx n . (4.2) 

For the definition of Po,d(t,x,y) see formula (4.26). By definition a filtration 
(<F t ) t *> is an increasing family of a-fields, i.e. 0 < t\ < t 2 < go implies “J tl a 3 r t2 . 
The process of Brownian motion {B(t) : t ^ 0} is said to be adapted to the 
filtration (3y ) t>0 if for every t ^ 0 the variable B(t) is Sy-measurable. It is 
assumed that the P-negligible sets belong to To- 


1.1. Strong solutions to stochastic differential equations. In this sec¬ 
tion we discuss strong or pathwise solutions to stochastic differential equations. 
We also show that if the stochastic differential equation in (4.108) possesses 
unique pathwise solutions, then it has unique weak solutions. We begin with a 
formal definition. 

4.2. Definition. The equation in (4.108) is said to have unique pathwise so¬ 
lutions, if for any Brownian motion {(B(t) : t T 0 ), (Q, T. P)} and any pair of 
M d -valued adapted processes {X(t) : t ^ 0} and {X'(t) : t ^ 0} for which 

a (s, X(s)) dB(s) + f b(s,X(s))ds and (4.3) 

Jo 

a(s,X'(s))dB(s) + f b(s,X'(s))ds (4.4) 

Jo 

it follows that X(t) = X'(t) P-almost surely for all t > 0 . If for any given 
Brownian motion (B(t)) t>0 the process (X(t)) t>0 is such that for P-almost all 
uj e fl the equality 

X(t,cu) = x+ f a (s, X(s,u/)) dB(s,uj) + f b(s,X(s,u>)) ds 

Jo Jo 

is true, then t >—* X(t) is called a strong solution. 


X(t) =x+[ 

Jo 

X'{t) =x + ( 

Jo 


Strong solutions are also called pathwise solutions. In order to facilitate the 
proof of Theorem 4.4 we insert the following lemma. 

4.3. Lemma. Let 7 be a positive real number. Then the following inequality 
holds: 

00 Tlj^h 1 / "j 2 1 \ 

V 7 ^! ^ 2 + Vt^ 4) exp (^g (VT + Vt^4) “2 J- ( 4 - 5 ) 


Since y'q + y'q + 4 < 2^7 + 2 , the inequality in (4.5) implies: 

00 77^/2 / " j ^ \ 

Yi WTT ^ V7 + 2 exp f - (7 + 1) j < 00. 


n —0 


(4.6) 


We will use the finiteness of the sum rather than the precise estimate. 
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Proof of Lemma 4.3. Let S > 0 be a positive number. Then we have by 
the Cauchy-Schwarz inequality 


00 yV 2' 


2 


d Vn! 


2 


7 


n/2 


(S + j) 


n/2 


<n—0 


\n=0 

oo 


d (<* + 7) n/2 Vn! 


< 


2 


7 


i (5 + 7 ) n ^ 

= 11 ' ' ' m — 


2 


(S + y) n 5 + 7 


, 5+7 


n=0 


n=0 


n! 


(4.7) 


The choice 5 = | ^—7 + ^ 7(7 + 4) j yields the equalities 

<5 + 7 = 1 (V 7 + V 7 + 4) - 1 , and ^ = ^(V7 + Vt + 4) > 
and so the result in (4.5) follows and completes the proof of Lemma 4.3. □ 


A version of the following result can be found in many books on stochastic 
differential equations: see e.g. [61, 107, 113]. 
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4.4. Theorem. Let cr^ (s, x) and bj ( s, x), 1 < j, k < d be continuous functions 
defined on [0, oo) x R d such that for all t > 0 there exists a constant K(t) with 
the property that 

d d 

2 Wj,k (s,x) - a j}k (s,y )| 2 + J] |bj (s,x) - bj (s,y) | 2 < K(t) \x - y\ 2 (4.8) 

j,k=l j =1 

for all 0 < s < t, and all x, y e M. d . Fix x e M. d , and let (Cl, T. P) be a probability 
space with a filtration (B t ) t>0 . Moreover, let { B(t ) : t ^ 0} be a Brownian mo¬ 
tion on the filtered probability space (fi,T t ,P). Then there exists an Mfi-valued 
process {X(t) : t ^ 0} such that, for all 0 < T < co, sup 0<t<r E [|X(f)| 2 ] < co, 
and such that 

= x+ f a (s, X(s)) dB(s) + 

Jo 

This process is pathwise unique in the sense of Definition f.2. 



f 


b(s,X(s)) ds, 0. (4.9) 


The techniques in the proof below are very similar to a method to prove the 
following version of Gronwall’s inequality: see e.g. [54], Let f, g, h: [0,T] R 
be continuous functions such that /(f) < g(t) + h(s)f(s) ds, 0 ^ t ^ T. If 
h ^ 0, then by induction with respect to k it follows that 


k t 

m < ad) + s f 

3 =1 




k 


h(s)f(s) ds, 


and hence 


/(f) < g(t) + J g(s) exp (J^ h(p) dp 


ds. 


Let C ([0,T],L 2 (Q, T, P; R d )) be the space of all continuous L 2 (fi, T, P; 
valued functions supplied with the norm: 


X\\= sup (E[|X(f)| 2 ]) V2 , 




Define the operator T : C ([0, T], L 2 (Cl, T, P; M d )) — C ([0, T], L 2 (Cl, T, P; R d )) 
by the formula 

TX(t ) = x + f a(s, X(s)) dB(s) + 

Jo 

Then the argumentation in the proof below shows that T is a mapping from 
C ([0, T], L 2 (f2,T,P;R d )) toC([0,T],L 2 (f2,T,P;R d )) indeed, and that T has 
a unique fixed point X which is a pathwise solution to the equation in (4.9). 



Proof. Existence. Fix 0 < T < co. Put X 0 (s) = x, 0 < s < f, and, for 
n ^ 1, 0 < f ^ T, 


X n+ \(t) = x + 




a (s,X n (s)) dB(s). 


(4.10) 
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By (4.10) we see, for n ^ 1 and 0 < t ^ T, 

x n+ 1 (t) -X n (t) = \ (b (s, X n (s))-b (s, X^is))) ds 

Jo 

f (cr(s,X n (s))-cr(s,X n _i(s))) dB(s). (4.11) 

Jo 


+ 


By assumption there exists functions s >—> Kj(s) and s >—> ($), 0 < s < T, 

such that for 

|bj ( s,y) — bj (s,rc)| < Kj(s) \y — x\ , 0 < s < T, x, y e and (4.12) 
Wij ( s,y ) - cT i:j (s,x)| < Kij(s) \y-x\, 0 < s < T, x,yeR d , (4.13) 

and such that ^ ( Kj(s) 2 + Kij(s) 2 ) ds < go for 0 < 1 < i, j < d. Let the 
function K(s) ^ 0 be such that K(s ) 2 = 2j=i J'O(s) 2 + Th=i max i^Kd Kij( s ) 2 - 
Then K(s ) 2 ds < go. Moreover, for n ^ 1 and 0 ^ t ^ T we infer, by using 
(4.11). (4.12) and (4.13), by the definition of K(s), and by standard properties 
of stochastic integrals relative to Brownian motion, the following inequality: 

E[|X re+ i(t) -X n (t)\ 2 ] < 2 J^iL( S ) 2 E[|X n ( S ) -X n _!( S )| 2 ] ds. (4.14) 

In order to obtain (4.14) we also used an inequality of the form (|a| + \b\) 2 < 
2 (|a| 2 + |6| 2 ), a, b e M d . The proofs of (4.15) and (4.18) require equalities of 
the form 


| fy[iL(sj ) 2 dsi.. .dsj 

JS<S^<^^^<Sn<t J A — 1 


(t K (p) 2 dpy 


j- 


, j e N, j ^ 1. 


Js<Sl<--<Sj<t J 

By employing induction the inequality in (4.14) yields, for 1 ^ j ^ n and for 
0 < t < T, the inequality: 

i2n 


E[ix„ + i(t)-x n («)n 

(s: K(p?dp )’~ 1 


^ y r 

Jo 


K(s ) 2 E [|X n _ i+1 (s) - X n -j(s )| 2 ] ds. (4.15) 


Since X 0 (s) = x the equality in (4.10) for n = 1 yields 

Xi(s)-X 0 (s)= b (p, x) dp + a (p, x) dB(p), 

Jo Jo 

and hence, for 0 ^ s < T, 


EflXi^-Xo^)! 2 ] <2 

Let A(s, x) ^ 0 be such that 


b (p, x) dp 


*,.7 = 1 


Wij (p> x )f dp ] . (4.16) 


.A(s, x) 2 = sup 

0 <T^S 


b (p, x ) dp 

Jo 


+ 


d rs 

s J 

hJ= 1 


<?ij (p,x) | 2 dp. 


(4.17) 
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Then (4.17) together with (4.15) with j = n yields 

(s^(p)^r 1 


E[|X„ +1 (()-X„(()| 2 ] « 2” f 

Jo 


0 (n-1)! 


K(s) 2 E[|X 1 (s) -X 0 (s)| 2 ] ds 


< 2 


71+1 


1 


n— 1 


< 2 n+1 A(t, x) 


(iK(p?dp) 

0 0 - 1 )! 

(S‘ K(pfdp) 


J 


0 0 - 1 )! 


i^(s) 2 ^4(s, a:) 2 ds 

71—1 

— K{sfds 


= 2 


77 + 1 


+M) 2 f fn^o) 

Jo<si<---<s n <t J j—i 


dsi ... ds r 


= 2 n+1 A(t, x)‘ 


(So K(sf 


ds 


n\ 


(4.18) 


From Lemma (4.3) and inequality (4.6) with 7 = 2 ^ K(s) 2 ds we infer: 

V (E[\X n+1 (t) - X n (t)\ 2 ]) 1/2 < 2A(t,x)J f K(syds + le& K M 2da+ *. 

n =0 V 

(4.19) 

From (4.19) it easily follows that there exists an adapted M . d -valued process 
(X(£)) 0<ts£T in L 2 (f2, £F T , P; !R d ) such that 


limE[|X n (t)-X(t)| 2 ] = 0 . 


(4.20) 


From (4.19) it also follows that this convergence also holds P-almost surely. 
The latter can be seen as follows. Fix 77 > 0. Then the probability of the event 
{limsup^^ | X n (t) — X(t)\ > r]} can be estimated as follows: 


P 


limsup | X n (t) — X(t)| >77 


^ inf P 

meN 


^ inf P 

meN 


< inf 1 

meN 


< inf -E 

77ieN T ] 


U {\X H (t)-X(t)\>r,} 

n=m 
00 

U {|A'„.(()-V. a (f)|>i?} 

77-1 >772 ^771 

f 00 

] 2 |A'„+i(t) - A„(t)| > n 

V 71 = 777 

00 

^ | X n+1 (t) - A„(t)| 


< 


-| 00 

inf - V E[\X n+1 (t) - X n (t)\] = 0 . (4.21) 

m(=N n 


771GN n 

1 n—m 


The final equality is a consequence of Lemma 4.3 together with (4.19) and the 

inequality E[|X n+ i(t) - X n (t) |] < (E[|X n+ 1 (i) - X n (t)\ 2 ]) 1/2 . Since 77 > 0 is 
arbitrary in (4.21) we infer that lim^oo X n (t) = X(t) (P-almost surely). This 
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P-almost sure convergence (as n —► oo) also implies that we may take pointwise 
limits in (4.10) to obtain: 

b(s,X(s)) ds+ f a(s,X(s)) dB(s). (4. 

Jo 

The equality in (4.22) shows the existence of pathwise or strong solutions to the 
equation in (4.9). 




Uniqueness. Let (Xi(t)) 0 ^ t<T and (X 2 (t)) 0 ^ t ^ T be two solutions to the stochas¬ 
tic differential equation in (4.9). By using a stopping time argument we may 
assume that sup 0 ^ s ^ T |X 2 (s) — Xi(s)| is P-almost surely bounded. Then 


X 2 (t)-X 1 (t) = f (b&Xtisfi+b&X^s))) ds 
Jo 

+ f {a(s,X 2 {s))-a(s,X 1 (s))) dB(s). (4.23) 

Jo 

As in the proof of (4.15) with j = n and (4.18) it then follows that 

(SA(p)Vff 1 


[|X 2 (i)-A' 1 (i)| 2 ] < 2 “ f 

Jo 


< 2 n sup E[|X 2 (s) -X!(s)| 2 ] 

0 <s<t 


o (n-1)! 

( Sl K {pf d P ) 


n! 


AT(s) 2 E[|A 2 (s) -X!(s)| 2 ] ds 

n 

(4.24) 


Since the right-hand side of (4.24) tends to 0 as n —*■ co we see that X 2 (t) = Xi(t) 
P-almost surely. So uniqueness follows. 


The proof of Theorem 4.4 is complete now. 


□ 


1.2. A martingale characterization of Brownian motion. The fol¬ 
lowing result we owe to Levy. 

4.5. Theorem. Let (12, T, P) be a probability space with filtration (or reference 
system) (9y) t>0 . Suppose “J is the a-algebra generated by augmented with 

the P -zero sets, and suppose “J t is continuous from the right: “J t = n s>t T s for all 
t ^ 0. Let {M(t) = (Mi(t),..., Md(t )) : t 5= 0} be an -valued local P -almost 
surely continuous martingale with the property that the quadratic covariation 
processes t > (Mj, Mf) (t) satisfy 

(Mi, Mj) (; t ) = S itj t, 1 ^ i, j d. (4.25) 

Then {M(t) : t ^ 0} is d-dimensional Brownian motion with initial distribution 
given by p(B) = P [M(0) e B], B e the Borel field ofM. d . 

It follows that the finite-dimensional distributions of the process t > M(t) are 
given by: 

P [M (t\) e Bi,... ,M ( t n ) e B n ] 

[ P0,d (fn tn —1 j 3'ri— 1 > X n ) " ‘ ‘ P0,d ( t 2 t\ , X \, X 2 ) Po,d (^1 j X, Xi) 

B n 
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dx n ■ ■ ■ dx\^j dfj,(x). 

Here p 0 ,<2 (t,x,y) is the classical Gaussian kernel: 

p ° At ' x ’ v) = J^ exp { Jl ^)' (4 ' 26) 

4.6. Remark. There is even a nicer result which says the following. Let X be 
a continuous Revalued process with stationary independent increments. Then, 
there exist unique b e and E e such that X (t)—X (0) is a (6, E)-Brownian 
motion. This means that X(t) is a Gaussian (or multivariate normal) vector 
such that E [X (t)] = bt and 

® [(^ii(^) — (. Xj 2 (t ) — bj 2 t)\ = tT,j 1 j 2 . 

For the one-dimensional case the reader is referred to Breiman [29]. For the 
higher dimensional case, see, e.g., Lowther [89]. 
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Proof of Theorem 4.5. Let £ e be arbitrary. First we show that it 
suffices to establish the equality: 

E [ e -h^(t)-M( s )> | = e -i|£| 2 h-s) ; t > s > o (4.27) 

For suppose that (4.27) is true for all £ e W :l . Observe that (4.27) implies 
E = e — 2 1^1 ( t-s ). Then, by standard approximation arguments, 

it follows that the variable M(t) — M(s ) is P-independent of 37,. . In other words 
the process t > M{t) possesses independent increments. Since the Fourier 
transform of the function y p 0 ,d (t — s, 0, y) is given by 

f e-^’ripoj (t - s, 0, y) dy = e~^\ 2 d-s) 

JR d 

it also follows that the distribution of M(t) — M(s ) is given by 

P [M{t] - M(s) e B] = f p 0 ,d (t — s, 0, y) dy. (4.28) 

Jb 

Moreover, for 0 < t\ < ■ ■ ■ < t n we also have 


P [M( 0) e Bq, M (t\) — M (0) e B x ,..., M ( t n ) — M (t n ~ 1 ) e B n ] 

= P [M(0) e B 0 ] P [M (t x ) - M( 0) g Bi] • • • P [M (t n ) - M (t n ^) e B n ] 


••• Po4(t 1 ,0,y 1 )---p O4 (t n -t n _ 1 ,0,y n )dy(y o )dy 1 ---dy n . 

Bq J Bi J B n 

Hence, if B is a Borel subset of 


Here B 0 , ..., B n are Borel subsets of 
M. d x • • • x R d , then it follows that 


-V- 

n+ltimes 


P [(MO0), M (h) - M( 0),..., M (t n ) - M (* n _i)) g B] 

fa PM (t ” - Vn) * w dVl '" dy "' (4 ' 29) 

B 

Next we compute the joint distribution of (M(0), M (: t x ),..., M (■ t n )) by em¬ 
ploying (4.29). Define the linear map £ : x • • • x —>■ x • • • x by 

£ (xq j 15 * * *) xy) (xo, xq Xo1^2 xq5..., x^ x:^—1) * 

Let 5 be a Borel subset of M. d x • • • xl J By (4.29) we get 


F[(M(0),...,M(t n ))eB] 

= F[£(M(0),...,M(t n ))e£(B)] 

= P [(M(0), M (B) - M(0),..., M (t n ) - M (*„_,)) g £ ( B )] 


J • • • Jpoy (B, 0, 2 / 1 ) • • • P(ld (tn - t n - 1, 


<(B) 


0, y n ) dy ( y 0 ) eh/i • • • <% n 


(change of variables: (y 0 , y u ..., y n ) = £ (x 0 , xq,..., x n )) 


= J • • • Jpo,d (ti,x 0 , xi) • • • p 0 ,d (4 - t n —i, x n —\, x n ) dy (x 0 ) dxq • • • dx n . (4.30) 

B 


251 


Download free eBooks at bookboon.com 




Advanced stochastic processes: Part II 


Stochastic differential equations 


In order to complete the proof of Theorem 4.5 from equality (4.30) it follows 
that it is sufficient to establish the equality in (4.27). Therefore, fix £ e and 
t > s ^ 0. An application of Ito’s lemma to the function x yields 

_ e -i(£,MOO> 


= —l 


2ft fe-^dM A r)-l 2 ft& 

j = l Z J,fc=l 

(formula (4.25)) 


d r t 

f 

n—1 JS 


e -i«,M(r)> dM .( T )_ 1 |^| 2 J 


dT . 


(4.31) 


j = l S 

Hence, from (4.31) it follows that 

d 

= —i 

3 

Since the processes 


(4.32) 


d nt 

1 ^ ^ j 

7 = 1 Js 


e -i«,M(r)-M( a )> dMj .( r ) _ I |£|2 f 

2 Js 


e -i(^,M(r)-M(s)> rfr _ 


f i * f e -i(ZM(T)-M(s)) dMj {r) i t>s, 1^3 ^ d, 

J s 

are local martingales, we infer by (possibly) using a stopping time argument 
that 

E | = l _ I |£| 2 £ [ e -*«,M(r)-M(s)> | grj (4.33) 

Next, let v(t), t ^ s, be given by 

v(t) = f E [ e -h^M(r)-M(s)> | rfr _ 

Js 

Then u(s) = 0, and (4.33) implies 


^(t) + i |£| 2 n(*) = I- 


From (4.34) we infer 

j t (e5<‘-»' £ '’„(*)) - Q |ft%(i) + v'(t)\ e5<‘->' £ l 2 = 


(4.34) 


(4.35) 


The equality in (4.35) implies: 


and thus we see 

v\t) + In 

Since v(s) = 0 (4.36) results in 


2 (e^“ s)l?|2 - lj , 


T = e -|(t- s )ici 2 

(4.36) 

v'(t) = 

(4.37) 
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The equality in (4.37) is the same as the one in (4.27). By the above arguments 
this completes the proof of Theorem 4.5. □ 

As a corollary to Theorem 4.5 we get the following result due to Levy. 

4.7. Corollary. Let { M(t ) : t ^ 0} be a continuous local martingale in M such 
that the process t *—» M(t) 2 — t is a local martingale as well. Then the process 
{M(t) : t ^ 0} is a Brownian motion with initial distribution given by i-i(B) = 
P [M(0) e B], Be B t . 

Proof. Since M(t ) 2 — t is a local martingale, it follows that the quadratic 
variation process t >—> (M, M) (; t ) satisfies (M, M) (; t ) = t, t ^ 0. So the result 
in Corollary 4.7 follows from Theorem 4.5. □ 


The following result contains a d-dimensional version of Corollary 4.7. 

4.8. Theorem. Let { M(t ) = (Mi(£),..., : t ^ 0} be a continuous local 

martingale with covariation process given by 

(Mj,M k ) (t) = f ® jtk ( s ) ds , 1 A j, k P d '. (4.38) 

Jo 

Let the d' x d-matrix process (%(f) : t ^ 0} be such that x(t)$(f)x(f)* = I, 
where I is the d x d identity matrix. Put B(t ) = \ Q x ( s ) dM(s). This integral 
should be interpreted in ltd sense. Then the process t >—► B(t) is d-dimensional 
Brownian motion. Put T (t) = <b(f)x(f)*, and suppose that T(t)%(t) = I, the 
d! x d! identity matrix. Then M{t) — M( 0) = T(s) dB(s). 

4.9. Remark. Since 

x(t) ($(t)x(t)*x(t) -1) = ~ I) x(t) = 0 

we see that the second equality in 4/(f)y(f) = &(t)x(t)*x(t) = / is only possible 
if we assume d = d!. Of course here we take the dimensions of the null and 
range space of the matrix x(t) into account. 


Proof of Theorem 4.8. Fix 1 i, j ^ d. We shall calculate the qua¬ 
dratic covariation process 

( Bi. Bj) (f) - /g J)‘ 1 (*(«))„, m(s), t [ l ' W(S)) (t) 


ftiJo 


d! d! r t 
k=ll=l 


(x( s ))*A (x(s)) u ^(s) kjl ds 


3,1 


f (x(s)^(s)x(s)*) iJ ds = tdij. 

Jo 


(4.39) 


From Theorem 4.5 and (4.39) we see that the process t i—► B(t) is a Brownian 
motion. This proves the first part of Theorem 4.8. Next we calculate 

f V(s)dB(s)= f $(s)x(s)dM(s)= f dM(s) = M(t) - M(0), (4.40) 

Jo Jo Jo 
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which completes the proof of Theorem 4.8. □ 

1.3. Exponential local martingales. Let t *—* N(t), 0 < t < T, be a 

continuous (local) martingale with variation process t > (TV, N) (t), 0 < t < 
T. In this subsection we discuss local martingales of the form t i—»• e~ z ^ = 
1 + $o e ~ Z ^ dN(s), t ^ 0, where Z(t ) = N(t) + | {N,N) ( t ). Such processes 
are called exponential local martingales. The following proposition serves as a 
preparation for Proposition 4.12. It also has some interest of its own. 

4.10. Proposition. Let (12, T, P) be a probability space with a filtration 
and let M = (M(t)) Q<t<T and N = ( N(t)) 0<t<T be two local martingales with 
M( 0) = iV(0) = 0. Put Z{t) = N(t) + 1 (N,N)''(t), and assume that E [e - ^] = 
1 for all 0 < t < T. Then the following assertions are true. 

(a) The process t > e~ z ^\ 0 < t < T, is a martingale; 

(b) The process t >—► e~ z ^> ( M(t ) + ( N , M) (t)) is a local martingale; 

(c) The process t >—> M(t) + (N, M) (t) is a local martingale relative to the 

filtration (3 r t) 0 <t<T su PP^ e d with the measure Qn : Tt —»■ [0,1] defined 
by Q n (A) = A e J T . 
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The measure Ojv can be called a risk neutral measure. Observe that, by asser¬ 
tion (a), Qat(^ 4) = IE whenever A belongs to % with 0 < f ^ T. Let 

r n be the stopping time defined by 

r n = inf 1 5 > 0 : | N(s a T)| + ^ (N, N) (s a T) > n 

and set Z n (t) = Z (t a r n ). Then the processes t > e~ Zn ^\ 0 ^ f ^ T, n e N, 
are martingales. It follows that E = 1, for all 0 < t < T, and for all 

n e N. By Fatou’s lemma we infer that 

lim e ~ Zn ^ < liminfE [e _Zn ^l ^ 1. (4-41) 

n —>oo J n—KX) L J 

In fact we have a stronger result. It says that an exponential local martingale 
is a submartingale. 

4.11. Theorem. Let the process t i— > e ~ z ^\ 0 < t < T, be a continuous lo¬ 
cal martingale. In general, this process is a submartingale. Consequently, if 
E [e _z ^I] = 1 for all 0 ^ t < T, then the process t <-*■ e~ z ^\ 0 ^ t ^ T, is a 
martingale. 


E [e~ m ] = E 



Proof. This result can be seen as follows. Let 0 < ti < t 2 , and choose the 
sequence of stopping times (r n ) n6N as above. Then, for A e T tl ATm , we have 


E |e- z(t2) l J = E [lim e - z ( t2AT ")l 


< 


lim inf E [< 


, — Z(t2 AT„) 


1 A ] = E[e“ z(tlATm) l,i] 


(4.42) 


From (4.42) it follows that: 

E [e“ z(t2) | T tlATm ] ^ e - z(tlATm) . (4.43) 

Since the event {r m > t \} belongs to 3y, ATm , from (4.43) we infer 
E [e- z(h> l lTm>ttl | J,,] = E [e~ z ^l [Tm>tl] | 3), AA J 

=S e- Z(,1Ar ” ) l(r„> tl | = e -z(fl, l{ Tm>tl j. (4.44) 


The first equality in (4.44) is a consequence of the fact that, if an event A 
belongs to T t ,, then A n {r m > t\} belongs to T t| ATm . In the left-hand side and 
the far right-hand side of (4.44) we let m —* co to obtain 

E [e _z ( t2 ) | dyj < P-almost surely. (4.45) 

The inequality in (4.45) shows that the process t >—> e~ z ^ is a submartingale. 
If 0 < t\ < t 2 < T, and if E [e _z ^ 2 I] = E [e - ^* 1 )], then (4.45) implies that 

E [e -2 ^ 2 -* | £T tl ] = e~ z(Ktl \ P-almost surely. (4.46) 

This completes the proof of Theorem 4.11. □ 


PROOF of 4.10. (a) An application of Ito’s formula and employing the 
equality (Z, Z) (t) = (N , N) (t) yields: 

e-m 
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r z(0) - f e~ z(p) dZ(p) + - f e~ z{p) d (Z, Z) (p) 
Jo 2 Jo 

-f 

i‘ 


= e~ z ^ 


-zip) 


= e~ z ^ - I e 


dN(p) ~|J e~ z W d (N, N) (p) + | J e~ z W d (TV, N) (p) 
~ z i p ) dN(p). (4.47) 


From the equalities in (4.47) it follows that the process t >—> e~ z ^\ 0 < t < T, is a 
local martingale. In view of the assumption that E [e -z ^] = 1 for all 0 < t < T 
it follows that the process in (a) is a genuine martingale: see Theorem 4.11. 

(b) Again we apply Ito’s lemma, now to the function (x, y) i-> e~ x y. Then we 
obtain: 

e~m ( M (t) + (TV, M) (t)) 

= - f e -z(p) ( M ( p ) + (_/v, M) (p)) dZ{p ) + f e“ zw ( dM{p) + d (TV, M) (p)) 

Jo Jo 

+ \ f e“ zw (. M(p) + (N , M) (p)) d (Z, Z) (p) 

- f e~ z{p) d (Z, M + (TV, M)) (p). (4.48) 

Jo 

By applying the equalities (Z,Z) = (N,N) and ( Z,M + (.TV, M)) = (.TV, M ) to 
the equality in (4.48) we obtain 

e~ z{t) + (TV,M) (0) 

= - f e_Z(p) (M(p) + (TV, M) (p)) dTV(p) 

JO 

- \ f e- z( ' ,) (M(p) + (N, M) (p)) d (N . AT) (p) 

+ f e~ z{p) ( dM(p) + d (TV, M) (p)) 

Jo 


+ 2 


If 

2 Jo 


f 


>0 


-Z(p) 


p) (M(p) + (IV, M) (p)) d (TV, TV) (p) - f (IV, M) (p) 

Jo 

(M(p) + (TV, M) (p)) dN(p) + f e" z(p) dM{p). (4.49) 

Jo 


Being the sum of two stochastic integrals with respect to (local) martingales 
the equality in (4.49) implies that the process in (b) is a local martingale. 

(c) By using a stopping time argument we may and do assume that the process 
t i—> M{t) + (TV, M ) (t) is bounded and so it belongs to L 1 (Q,fF r ,Q In)- Let 
0 < t\ < t 2 < T, and put 

Y {h) = E Qjv [M (t 2 ) + (TV, M) {t0 | X,] • 


256 


Download free eBooks at bookboon.com 


Advanced stochastic processes: Part II 


Stochastic differential equations 


Then the stochastic variable Y (G) is T f| -measurable and, for all bounded Im¬ 
measurable variables G we have 

E [e~ z( - T) (M (t 2 ) + (N , M) 0 1 2 )) G]= E [e~ z{T) Y (G) G] . (4.50) 

Since the process t >—> e~ z ^\ 0 < t < T, is a P martingale, the equality in (4.50) 
implies: 

E [e~ z(t2) (M (t 2 ) + (N, M) {t 2 )) G] = E [e~ z(tl) Y (G) G] . (4.51) 

From assertion (b) together with our stopping time argument we see that the 
process t > e~ z ^ (M (t) + (N , M) (t)) is a P-martingale. From (4.51) we then 
infer: 

E [e~ z{tl) (M (G) + ( N , M) (G)) G] = E [e~ z{tl) Y (G) G] (4.52) 

for all bounded T tl -measurable variables G. So finally we get, P-almost surely, 

e~ z (h) Y ( tl ) = e ~z(h) + ( tl )) ? 

and hence, 

Y (G) = M (G) + (-A, M) (fi), P-almost surely. 

This shows assertion (c) and completes the proof of Proposition 4.10. □ 
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A combination of Proposition 4.10 and Levy’s characterization of Brownian 
motion in yields the following result. 

4.12. Proposition. Let the W* -valued process s ► c(s) be an adapted pro¬ 
cess which predictable relative to Brownian motion (B(t)) t>0 . Put N(t ) = 

c(s) dB(s), and 

Z(t ) = N(t) + - (N, N) (t) = f c(s) dB(s) + - f |c(s)| 2 ds, t ^ 0. 

2 Jo 2 Jo 

Suppose that for all t > 0 the equality E [ e _z W] = 1 holds. Then the process 
(W(t)) t>Q , defined by W(t ) = B(t) + \ ) t Q c(s)ds is Brownian motion relative to 
the measure A >—» Qn(A), As T t , as defined in Proposition f.10. 


PROOF. An application of Proposition 4.10 with M(t) = Bj(t ) shows that 
the process 

Wj(t) - Bj(t) + £cj(s)ds = Bj(t) + ( t) 

= B j (t) + (N,B j )(t) 


is a local QAr-martingale. Moreover, (l-hj,, W n ) (ft) = Sj lt j 2 t. From Theorem 4.5 
we see that the process t >—► W(t) is a Q^-Brownian motion. This completes 
the proof of Proposition 4.12. □ 


It will be very convenient to introduce Hermite polynomials {hk{x)) keN , and to 
establish some of their properties. In the context of stochastic calculus they 
also play a central role. The Hermite polynomial h k (x) is defined by 

ht(x) = (—l)*^ 2 (A) (e . (4.53) 

For k e N, x e M, a > 0, we write 

H k {x,a) = a k / 2 h k 

Then we have H$(x, a) = 1, H\{x, a ) = x, H 2 (x,a) = x 2 — a, w (x, a) = x 3 — ax. 
The Hermite polynomials satisfy the following recurrence relation: 

h k+2 {x) - xh k+ i(x) + (k + T)h k {x) = 0, k ^ 0, (4.54) 

and therefore 

H k+2 (x, a) — xH k+ i(x, a) + {k + 1 )aH k (x, a) = 0, k ^ 0. (4.55) 

The equality in (4.54) can be proved by induction and the definition of h k in 
(4.53). From the definition of h k +i(x) it follows that h' k+1 (x) = xh k+ i(x) — 
h k+2 (x), and so, by (4.54) we see 

h' k+1 (x) = (k + 1 )h k (x), k>0. (4.56) 

From (4.54) and (4.56) we infer 

h k+2 (x) - xh k+ i{x) + h' k+1 (x) = 0, k>0, 
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and hence 

hk+i(x) — xhk(x ) + h' k {x) = 0, k ^ 0. (4-57) 

By differentiating the equality in (4.57) and again using (4.56) we obtain the 
following differential equation: 

h' k (x) — xh' k (x) + kh k (x ) = 0, k ^ 0. (4.58) 

In the following proposition we collect some of their properties. 


4.13. Proposition. For r,iel and a > 0 the following identities are true: 

00 T k 

“ = ( 4 - 59 ) 


1 2 


0 rx-^r 2 _ 


n k\ 

k —0 

°o b 

— r K 

k\ 


00 b 

— r K 




fc =0 


fc =0 


(4.60) 


^ 1 ^ 

—i7 fc+ i(x,a) = (k + l)74(x,a), and ~ — H k (x,a) + —H k (x,a) = 0. 

^ ra (4.61) 


PROOF. Let the sequence ( h k (x )) be such that, for all x and re C, the 

' ' keN 


equality 


holds. Then 


00 k 

^ T ‘ = 2 hhcW 


fc =0 


h k (x) = ( ^ 


= (-1)* 

= Mt). 


1 2 
e rx— 2 F 


-x 2 

= e? 


T—0 


d_ 

dr 


e -\{r-xf 


d 

dx 


) ( eHMI ) L „ - (- 1 * 1 ( 


d 

dx 


T =0 


-±X 2 

e * 


(4.62) 


(4.63) 

The equality in (4.63) implies the identity in (4.60). By a correct scaling (r v 'a 


replaces r, and —j= replaces x) the equality in (4.59) follows from (4.60) and 


a 


the definition of H k (x,a). The equalities in (4.61) follow from (4.56) and from 
(4.58) respectively. Altogether this completes the proof of Proposition 4.13. □ 


In the following proposition the process t •—» M(t), t e [0,T], is a martingale 
on the probability space (Q, T, P). Its quadratic variation process is denoted by 
* — (M, M) (t), te [0,T], 


4.14. Proposition. The following identities hold: 

H k +i (. M{t ), (M, M) (t)) f* H k ( M(s ), (M, M) (s)) 


(k + 1)! 


I 

I 


k\ 


dM(s ) 


ldM (si)... dM (sfc + i). (4.64) 


0<si<...<Sk+i<t 
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In addition, the following equalities hold as well: 
e rM(t)-±T 2 (M,M)(t) 


f 


= 1 + T e —w;-2 


tM(s) — \t 2 (M,M) (s) 


dM(s) 


= 1 + 


+ T 


£-1 k 
— T K 


z—l p 

5"i 

1 


k=\ Jo<si <t 


0<si<---<S£<t 


J ldM( Sl )...dM (s k ) 

J e TM( S1 )-^(M,M)(si) dM ( Sl ) dM fa) ( 4 . 65 ) 


m'£ i -H k {M{t),(M,M) (t)) 


k =0 


+ T 


f f e rM( Sl )-|r 2 <M,M)( Sl ) m fa) . dM fa) 

Jo<si <---<Sf<t J 


J0<si<--<S£<t 

k 


£ k 

— r K 


k =0 

+ r 


I 




rM( Sl )-|r 2 (M,M)( Sl ) _ dM fa) . . . fa) . ( 4 66 ) 


Please notice that in the equalities in (4.64) through (4.66) the order of inte¬ 
gration has to be respected: first we integrate with respect dM (si), then with 
respect to dM (s 2 ) and so on. 
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Proof. These equalities follow from Ito’s formula and the equalities in 
Proposition 4.13. Ito’s lemma is applied to the functions (x,a) Hk+i(x,a), 
and (x,a) •—» e TX ~ 2 T a with x = M(s), and a = (M,M)(s). In particular 
the equalities in (4.61) are relevant. This completes the proof of Proposition 


4.14. 


□ 


In the following proposition we collect some equalities in case we consider an 
exponential martingale t i—> e M ( t )- 2 ( M < M )(t) in case the process t >—> (M, M) (t) 
is deterministic. 


4.15. Proposition. Let t i—> M(t), 0 < t < T, be a martingale on (f2,T, P) 
with the property that the variation process t >—► (M, M) (t), 0 < t < T, is 
deterministic. The the following identities are true: 


E 


J 

Jo< 


f dM( Sl )...dM(s kl )- f \dM{ Pl ). 

0<si<-~<s kl <t J J0<p\<“-<pk 2 <t J 

= E [H k] ( M(t ), (M, M) it)) H k2 ( M(t ), (M, M) (*))] 

((M, M) (t)t 


h\ 


'k\,k2 


, and 


E 


f f e M( ai )-i(M,M>( fll ) dM dM 

J0<si<-~<S£<t J 

Jo P — !)• 


t-1 


_ e (M.M)«) _ V «V. V) (OP 


2 

i=o 


dM ( pk 2 ) 


(4.67) 


(4.68) 


Proof. Let the predictable processes s i—> T\(s) and s F 2 (s) be such that 


the quantities E 


£ |F,( S )| 2 <f(M,M) (*) 


finite. Then we have 

r 't'2 


and E 


£ |F( S )| 2 d(M,M} (s) 


are 


E 


f 2 Fi(s) dM(s) • f 2 F 2 (s) dM(s) = E f ' F 1 (s)F 2 (s) d (M, M) ( 5 ) 

Jo Jti J Uo 


t2 
Ut ! 


(' 


.69) 


for 0 ^ fi < f 2 ^ T. By repeatedly employing the equality in (4.69) and 
using the fact that the process s > (M, M) (s) is deterministic we infer, for 
1 ^ hi < k- 2 , and 0 < t ^ T, with i = k 2 ~ ki, 


E 


f f dM ( 5l ) ...dM (s kl ) • f f dM ( Pl ) ...dM ( Pk2 ) 

J0<S!<---<S kl <t J JO<01 <--<Ph~<t J 


0<P!<--<p k <t ■ 


= E 


f f dM ( Sl ) ...dM (s t ) d (M, M) (s m ) ...d(M,M) (s k2 ) 

J0<si<---<s k2 <t J 
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- E 


j j 

J 0 <si<---<S£<t J 


((M, M) (t) — (M, M) ( s e )) 
hi 


ki 


dM (si)... dM (se) 


If in (4.70) h = I 2 , and so £ = 0, then we obtain 


E 


f f dM( Sl )...dM(s kl ) 

Jo<si<---<s/ ei <t J 


«M,M) (t)) 

hi 


ki 


= 0. 

(4.70) 

(4.71) 


The equalities in (4.70) and (4.71) show the equalities in (4.67). The proof of 
the equalities requires an induction argument. For £ = 1 we have 

2 " 


E 


f 


f e M(si) -|( M - M > (si) dM (si) 

Jo 

E [ e 2M( S )-<M,M)( S )j d ^ ^ 


0 

= r e 

Jo 


,2M(s)-h2M,2M>(s) 




= f d (M, M) (s) = - 1. 

Jo 


(4.72) 


The equalities in (4.72) imply those in (4.68) for £ = 1. The second equality 
follows by partial integration and induction with respect to £. The first equality 
in (4.68) can be obtained by an argument which is very similar to the proof of 
the equality in (4.67) with h = h = £. The details are left to the reader. 


This completes the proof of Proposition 4.15. 


□ 


4.16. Corollary. Let the hypotheses and notation be as in Proposition 4-H- 
Then 

lim T e f [ e rM{. S1 )-\rpM,M){si) dM dM = 0 , (4.73) 

£—>°° Jo<si< - <se<t J 


P-almost surely. If the limit in (4-73) is in fact an L 1 -limit, then the pro¬ 
cess t i—> e TM d)- 2 T < M ’ M >(d is a martingale. In particular, it then follows that 


E 


d tM (t) — | t 2 (. M,M) (t) 


orem 4-11- 


1; compare with the inequality in (4-4V an d with The- 


If the process t > (M, M) (t) is real-valued and deterministic, then the limit in 
(4-73) is an L 2 -limit, and so also an L 1 -limit. 


Proof. Equality (4.73) in Corollary 4.16 follows from the equality in (4.59) 
in Proposition 4.13 with x = M(t) and a = (M, M) (t) together with the equal¬ 
ities in (4.65) and (4.61). The assertion about the Id-convergence also follows 
from these arguments. The only topic that requires some is the one about 
the situation where the process t >—* (M, M) (t) is deterministic. In this case 
the terms in the sum in (4.65) are orthogonal in L 2 (Q, Tf. P), and this sum 
converges in L 2 -sense to e TM ( t )- 2 ' r2 ( M ’ M )( t )_ These assertions follow from the 
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identities (4.67) and (4.68) in Proposition 4.15. This completes the proof of 
Corollary 4.16. □ 

The previous results, i.e. Proposition 4.14 and Corollary 4.16 are applicable if 
the martingale M(t) is of the form M(t) = ^ h(s) • dW(s ), where t i-> W{t) 
is standard Brownian motion. Then (M, M) (t) = \h(s)\ 2 ds. If s h(s) is 

deterministic, then in (4.73) we have L 2 -convergence. These martingales play a 
role in the martingale representation theorem: see Theorem 4.21. 

1.4. Weak solutions to stochastic differential equations. In the fol¬ 
lowing theorem the symbols and bj, 1 ^ i, j ^ d ) stand for real-valued 
locally bounded Borel measurable functions defined on [0, go) xR d . The matrix 
(a*j(s, x))*j =1 is defined by 

d 

a j,k( s , x ) = 2 a i:k (s,x)a jtk (s,x) = (a(s, x)cr* (s, x)) id . 

k =l 

For s ^ 0, the operator L(s) is defined on C 2 (R d ) with values in the space of 
locally bounded Borel measurable functions: 

-id d 

L(s)f(x) = - J] a iJ (s,x)D l D j f(x) + J]b j (s,x)D j f(x), f e C 2 (R d ) . 

hj =1 3 = 1 

(4.74) 
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The following theorem shows the close relationship between weak solutions and 
solutions to the martingale problem. 

4.17. Theorem. Let (f2,£F, P) be a probability space with a right-continuous 
filtration (3y) t>0 . Let { X(t ) = (X\(t),... ,Xfit )) : t ^ 0} be a d-dimensional 
continuous adapted process. Then the following assertions are equivalent: 


(i) For every f e C 2 (]R d ) the process 

t~f(X{t))-f (A'(0)) - f L{s)f (A(s)) is (4.75) 

Jo 

is a local martingale. 

(ii) The processes 

1 1 —► Mj(t) := Xj(t ) — f bj (s,X(s)) ds, t ^ 0, 1 ^ j < d, (4.76) 

Jo 

are local martingales with covariation processes 

t ^ (M { , Mj ) ( t) = f a itj (s,X(s)) ds, t^ 0, 1 i, j < d. (4.77) 

Jo 

(iii) On an extended probability space (12 x 12', “J t ® H' t , P x P') there exists 
a Brownian motion { B(t ) : t ^ 0} starting at 0 such that 


X(t) = X(0)+ f b(s,X(s)) ds+ f a(s,X(s)) dB(s), t> 0. (4.78) 

Jo Jo 


Here (Lt',H' t , P') is an independent copy of (12, T t , P). Moreover, the equality in 
(f.78) implies that the stochastic integral (co, co / ) h-»• ^cr(s,X(s)) dB(s) 
is P x P '-independent of uj'. If the matrix a (s, y ) is invertible, then there is no 
need for this extension. 


Examples of (Feller) semigroups can be manufactured by taking a continuous 
function <p : [0, go) x E —> E with the property that cp (s + t, x) = p ( t, p (s, x )), 
for all s, t ^ 0 and x e E. Then the mappings / >—► P(t)f, with P(t)f(x) = 
f (p (£, x)) defines a semigroup. It is a Feller semigroup if lim x _>A T Y , x ) = A. 
An explicit example of such a function, which does not provide a Feller-Dynkin 

X 

semigroup on Co (M) is given by p(t, x) = — . : (example due to V. 

Y 1 + l tx2 


[71]). Put u(t,x ) = P(t)f(x) = f (p(t,x)). Then 
In fact this (counter-)example shows that solutions 


Kolokoltsov [72], and 
du ndu 

- {t>x) = -x 

to the martingale problem do not necessarily give rise to Feller-Dynkin semi¬ 
groups. These are semigroups which preserve not only the continuity, but also 
the fact that functions which tend to zero at A are mapped to functions with 
the same property. However, for Feller semigroups we only require that con¬ 
tinuous functions with values in [0,1] are mapped to continuous functions with 
the same properties. Therefore, it is not needed to include a hypothesis like 
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(4.79) which reads as follows: for every ( r,s,t,x ) e [0,T] 3 x E, t < s < t, the 
following equality holds: 


P T;X [X(t) e E] = P T , X [X(t) e E, X(s) e E] . (4.79) 


Nadir ashvili [99] constructs an elliptic operator in a bounded open domain 
U c with a regular boundary such that the martingale problem is not 
uniquely solvable. More precisely the result reads as follows. Consider an elliptic 

d d 2 

operator L = ) a 2 ,, — —-— , where a. u = a ik are measurable functions on W l 

dx i dx * 


such that 


d 

c _1 l«l 2 ss 2 c|?| 2 , {6 nr 1 , 

3,k =1 


for some ellipticity constant c ^ 1. There exists a diffusion (X(t),P x ) corre¬ 
sponding to the operator L which can be defined as a solution to the martin¬ 
gale problem P[X(0) = x] = 1, f(X(t)) - f (X( 0)) - £/(X(s)) ds is a P x - 
martingale for all / 6 C 2 (M d ). Nadirashvili is interested in non-uniqueness in 
the above martingale problem and in non-uniqueness of solutions to the Dirich- 
let problem Lu = 0 in Cl, the unit ball in u = g on dCl, where fl a 
is a bounded domain with smooth boundary and g e C 2 (dil). In particular, 
so-called good solutions u to the Dirichlet problem are investigated. A good 

solution is a function u which is the limit of a subsequence of solutions u n , 

d 

n e N, to the equation L n u n = } ah-—= 0 in fl, u n = g on dQ, where 

,-t^i dxjdik 

the operators L n are elliptic with smooth coefficients a'jj, and a common el- 
lipticity constant c such that a^ k —* aj t k almost everywhere in O as n -» co. 
The main result is the following theorem: There exists an elliptic operator L 
of the above form defined in the unit ball B\ cz M d , d ^ 3, and there is a 
function g e C 2 (c B i ) such that the formulated Dirichlet problem has at least 
two good solutions. An immediate consequence is non-uniqueness of solutions 
to the corresponding martingale problem. 


The following corollary easily follows from Theorem 4.17. It establishes a close 
relationship between unique weak solutions to stochastic differential equations 
and unique solutions to the martingale problem. For the precise notion of 
“unique weak solutions” see Definition 4.19 below. This result should also be 
compared with Proposition 3.43, where the connection with (strong) Markov 
processes is explained. 

4.18. Corollary. Let the notation and hypotheses be as in Theorem 4-17. Put 
D = C ([0, go), M d ), and X(t)(u ) = oj(t), t 5= 0, u> e D. Fix x e E. Then the 
following assertions are equivalent: 

(i) There exists a unique probability measure P on T such that the process 

(A'(0)) - f L(s)f (X(s)) ds 

Jo 
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is a P -martingale for all C 2 -functions f with compact support, and such 
that P [W(0) = x\ = 1. 

(ii) The stochastic integral equation 

a (s,X(s)) dB(s) + f b(s,X(s))ds (4.80) 

Jo 

has unique weak solutions. 

4.19. Definition. The equation in (4.80) is said to have unique weak solutions 
on the interval [0,T], also called unique distributional solutions, provided that 
the finite-dimensional distributions of the process X(t), < t < T, which satisfy 

(4.80) do not depend on the particular Brownian motion B(t) which occurs in 

(4.80) . This is the case if and only if for any pair of Brownian motions 

{(£(*) : T>t ^ 0),(D,T,P)} and {(B'(t) : T > t ^ 0), (Q\ T, P')} 

and any pair of adapted processes {X(t) : T ^ t ^ 0} and {X'(t) : T ^ t ^ 0} 
for which 

X(t) = x + f cr (s, X(s)) dB(s) + f b(s,X(s))ds and 

Jo Jo 

X'(t) = x+ f a(s,X'(s))dB'(s) + f b(s,X'(s)) ds 

Jo Jo 

the finite-dimensional distributions of the process {X(t) : T ^ t ^ 0} relative 
to P coincide with the finite-dimensional distributions of {X'(t) : T ^ t ^ 0} 
relative to P'. 
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Proof of Theorem 4.17. (i) => (ii) With f 3 (x 1; ...,x d ) = x j: 1 < j < 
d, assertion (i) implies that the process 

M j (t) = X j (t)~ [’b j (s,X(s))ds = \ L{s)f j (X(s))ds (4.81) 

Jo Jo 

is a local martingale. We will show that the processes 

| Mi{t)Mj{t) - J a id (s,X(s)) ds : t ^ o|, 1 i, j < d, 

are local martingales as well. To this end fix 1 ^ i, j < d, and define the 
function fij : M. d —► M by fij (xi,... , = X{Xj. From (i) it follows that the 
process 

- £ (dij (s, X(s)) + bi (s, X(s)) Xj(s) + bj (s, X(s)) Xi(s)) dsj 

is a local martingale. For brevity we write 

= a i,j 5 = bj ( 5 ,^( 5 )) , (s) = bi (<$, Jf(s)) , 

Mi(s) = Xi(s) - f fair) dr, M 3 (s) = X^s) - f /3j(r) dr, 

Jo Jo 

MiM = XMXjis) - f W,(t)Xj(t) + ft(r)X.(r) + c,j(t)) dr. (4.82) 

Jo 

Then the processes Mi and M i: j are local martingales. Moreover, we have 

Mi{t) + £ Pi{s)d^j 

: f (A(f)V'( t ) + Pj( r ) X i( T ) + a «,i( T )) dr + M id (t) 

Jo 

: f (AO) (Xj(r) - Mj(r)) + AO) (V(r) - M<(r)) + a<j(r)) dr 

Jo 

+ f (AO)V'O) + /3j(T)Mi(T)) dr + Mjj(t) 

Jo 

: f A(t) (V(t) - Mj(r)) dr + ( AO) (V(r) - M^r)) dr 

Jo Jo 

+ f « i;i (r)dr+ f (AO)-VO) + A 0)^*0)) dr + 

Jo Jo 

: f AO) f Pj(s)dsdT- j- f fdj (r) f /3i(s)dsdr 

Jo Jo Jo Jo 

+ f aij(r)dr+ f (AO)ViO) + dr + 

Jo Jo 

: J J AO) AO dT ds + J J AO)A( s ) dr ds 


0 <S<T<t 


0 <T<S<t 


+ f « i;i (r)dr+ f (AO)-VO) + AO)-VO)) dr + M id (t) 

Jo Jo 
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= f A( r ) dr f /3j(s) ds + f otij(s) ds + 

Jo Jo Jo 

+ f (/3 i (s)M j (s) + /3j(s)Mi(s)) ds. (4.83) 

Jo 

Consequently, from (4.83) we see 

Mi(t)Mj(t) — f aij(s)ds 

Jo 

= - f (0i(s) (Mj(t) - Mj(s)) + 0j(s) - Mi(s))) ds. (4.84) 

Jo 

It is readily verified that the processes 

f A(s) (Mj(t) - Mj(s )) ds and f ^(s) (M^t) - M^s)) ds 

Jo Jo 

are local martingales. It follows that the process 

Mi(t)Mj(t ) — J* Oiij(s) ds : t ^ 0 


is a local martingale. So that the covariation process (Mj, Mj) is given by 
(Mi, Mj) ( t ) = a i,j( s )ds. 

(ii) => (iii) This implication follows from an application of Theorem 4.8 with 
Tj j (t) = a,ij (t, X(t)), and %(£) = a (t, X(t)) -1 . If the matrix process a [t, X(t)) 
is not invertible we proceed as follows. First we choose a Brownian motion 
(B'(t)) t>0 which is independent of (f2,£F t) P) and which lives on the probability 
space (O', 3^, P'). The probability spaces (0,T t ,P) and (O'jT^P') are coupled 
by employing a standard extension of the original probability space (f!,3y,P). 

This extension is denoted by T t , pY where Q = Q x O', T t = T f (x) Tj, and 

P = P x P'. Finally, B' = B’ ( t,u> /), t ^ 0, (co,lu') e Q x Cl'. We have a 

martingale M(s), 0 < s < t, on (f2, £F t ,P) with the properties of assertion (ii). 
We introduce the matrix processes ^ e (s ), e > 0, E R (s), and E^(s) as follows 


4>e(s) = cr* (s,X(s)) (cr(s,X(s))cr* (s,X(s)) + el) 1 
E r (s) = lim a* (s, X(s)) (a (s, X(s)) a* (s, X(s)) + el)~ l a (s, X(s )), and 

En(s) = I — E r (s). 


The matrix E R (s ) can be considered as an orthogonal projection on the range 
of the matrix a* (s,X(s)) a (s, X(s)), and E N (s as an orthogonal projection on 
its null space. More precisely, 


E R (s)a*(s,X(s)) = a*(s,X(s)), and a (s, X(s)) E N (s) = 0. 
In terms of these processes we define the following process: 


B(s) = lim 

ejO 


f 4(r)JM(r)+ f E n (t) dB'(r). (4.85) 

Jo Jo 
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Next we will prove that the process s i—► B(s) is a Brownian motion, and that 
M(s) = ^cr(r, X(t)) dB(r). Put 


Then we have: 


B e (s)= f^(r)dM(r)+ f E n {t) dB\r). 

Jo Jo 


(4.86) 


(B £ ji , B £ j 2 ) ( 5 ) 


d rs ^ ^ 

2 $e,ji,ki (r)A, jlM (■ r)a kl/ (r, X(r)) a k2/ (r, X(r)) dr 

fci,fc 2 /=i 


+ 2 ( T ) E NJ2^i ( T ) d ( M *i> 5 fc) ( T ) 

fc=l 


^ r s ^ 

+ 2 ( r ) £; Ar,ii,Ki (t) d (M kl , P^} (r) 

fc=i Jo 


d r»s 

+ 2 E N,j u k(r)E Nj2tk (T)dT 

k =1 J o 

(the processes M and I?' are P-independent) 

= f dr 

Jo v / hJ 2 

+ f (£ n (t)EJ(t)) dr. (4.87) 

Jo 
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From (4.87) we infer by continuity and the definition of Er(t) that 
( B ji, B h) ( s ) = Km (B Stjl ,B £j2 ) (s) 

- f (Er(t)E> r (t)) dr + f (E N (T)E* K (r)) nJ2 dr 

Jo Jo 

= f (E R (r)E’ R (r) + E N (r)E’ N (r)) dr 

Jo 

(the processes Er(t ) and En(t) are orthogonal projections such that Er{t) + 
E n (t) = I) 


- <W- (4.88) 

From Levy’s theorem 4.5 it follows that the process s > B(s), 0 < s ^ t, is a 

Brownian motion. In order to finish the proof of the implication (ii) => (iii) 

we still have to prove the equality M(s) = ^ a (r, X(r)) dB{r). For brevity we 
write <j(t) = a (r, X(r)). Then by definition and standard calculations with 
martingales we obtain: 

M(s) — f a(r) dB e (r) = M(s) - f a(r)xp £ (r) dM( t) - f a(r) E n (t) dB'(r) 

Jo Jo Jo 

= f (l — ct(t)ct*(t) (<j(t)<j*(t) + el) -1 ) dM(r) 

Jo 

= ef (<j(r)cr*(r) + el)~ l dM(r). (4.89) 

Jo 

From (4.89) together with the fact that covariation process of the local martin¬ 
gale M(s) is given by a(r)a*(r) dr, it follows that the covariation matrix of 
the local martingale 


M(s) - f cr (r) dB £ {r) 

Jo 


is given by 

eT 


In addition, we have in spectral sense: 

0 ^ e 2 (d(r)cr*(r) + el) -1 cr(r)cr*i 
and thus in L 2 -sense we have 


(t) + el) 1 dr. 

(4.90) 

t) + eiy 1 < 11, 

(4.91) 

cr(r)B £ (r)) = 0. 

(4.92) 


The equality in (4.92) completes the proof of the implication (ii) —> (iii). 


(iii) =^> (i) Let / : W ! —> E be a twice continuously differentiable function. By 
Ito’s lemma we get 

(A'(0)) - f L(s)f (X(s)) ds 

Jo 
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= f V/ (X(s)) ■ dX( s ) + \ 2 D,D j f(X( s ))d{X„X J )( s ) 

JO - ij =1 JO 

- \ L(s)f(X(s))ds 

Jo 


d r t 


= S ^(s,X( s ))D,/(X( s ))d s 

3-1 Jo 

2 r <Ji,k {s, X(s)) a jtk (s, A(s)) DiDjf (X(s)) ds 

i,j = 1 k =1 

+ f Wf(X(s))a(s,X(s))dB(s)- f L(s)F (X(s)) ds 

Jo Jo 

= f V/ (X(s)) a (s, X(s)) dB(s). (4.93) 

Jo 

The final expression in (4.93) is a local martingale. Hence (iii) implies (i). 

This completes the proof of Theorem 4.8. □ 

4.20. Remark. The implication (ii) ==^> (i) in Theorem 4.17 can also be proved 
directly by using Ito calculus. Suppose that the local martingales t i-» Mj (t ), 
1 < j ^ d, are defined as in assertion (ii) with covariation processes as in (4.77). 
Let / be a C 2 - function defined on WL d . Then we have: 

/ (X(t)) -f(X(0»- f L(s)f (X(s» ds 

Jo 

- fv/pfMJAYM + i 2 ( DiDjf (X(s)) diXi.XJis) 

Jo ^ ij =1 Jo 

- fL(s)/(X(s)) ds 

Jo 

= f V/(X(s)) JM(s)+ f Vf(X(s))b(s,X(s)) ds 

Jo Jo 

■ V f (X(s)) d (Mi, Mj) (s) — f L(s)f(X(s)) ds 

i,j= l Jo Jo 

= fv/(X(s)) JM(s)+ f Xf(X(s)) b(s,X(s)) ds 

Jo Jo 

+ \ Y. f A-Dj/ (*(«)) a,ij (s, X(s)) ds - f L(s)/ (X(s)) ds 

Z i,j=i J° J° 

= f V/(X(s)) dM(s). (4.94) 

Jo 

Assertion (i) is a consequence of equality (4.94). 


1 

+ 2 
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2. A martingale representation theorem 

In this section we formulate and prove the martingale theorem based on an 
n-dimensional Brownian motion. Proofs are, essentially speaking, taken from 
[106]. Let (VL(s)) 0 ^ s<oo be standard Brownian motion in K n , and let be 
the cr-field generated by ( W(s)) Q ^ s ^ t augmented with the P-null sets. For h e 

L 00 ([0,T];M n ) we write Xh(t) := e L^( s )'^( s )~iLI^( s )l ds . 

4.21. Theorem. Let be the subspace of (f2,SF r ,P) spanned by the expo- 
nentials X h {T) := e SlKs)-dW{s)-\\l\h{s)\ 2 ds^ h G L^ mple ([0,T];R n ). Then is 
dense in the space L 2 (f2, £F T) P). 

In Theorem 4.21 the space LT mple ([0,T];M n ) consists of those M n -valued func¬ 
tions h e L 00 ([0, T]; R n ) which can be written in the form 

N / N \ N 

H*) - 2 ( XA = 2 V o « s < T, N € N, (4.95) 

k =1 \j=k ) j=1 

where, for any N e N, 0 = to < ti < ■ ■ ■ < = T is an arbitrary partition of 

the interval [0,T], and where (A/) 1<J</V are arbitrary vectors in M n . Observe 

that, for such functions h, h(s ) • dW(s) = Yjj=i \ ' W ( tj ). Also notice that, 

by Ito’s lemma, X h (T) = 1 + ^X h (s)h(s) ■ dW(s ), h e L 00 ([0, T] ; M n ). 


> Apply now 
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In the proof of Theorem 4.21 the following notation is employed. The symbol 
(7® (E' ixA: ) stands for the vector space of those C°°-functions defined on all 
real n x N matrices A with the property that all functions of the form 

A i—» (l + ||A||^ s ) m A), meN, 1 < j ^ N, 1 < k < n, a 3}k e N, 

are bounded. Here Dj 3 f ’ k stands for the derivative of order a 3 j. relative to the 
variable The symbol ||A|| HS stands for the Hilbert-Schmidt norm of the 

matrix A; that is 

N n 

II'MIhS = Xj Xj l A h fc l ) A = ■ 

j =1 k =1 

Functions of the form A >—» exp (—| ||A||^ g ) belong to the space C® (E nxJV ). 
Observe that C® (]R nxAr ) constitutes a dense subspace of Co (M nxJV ), i.e. the 
space of complex-valued continuous functions which tend to 0 at go equipped 
with the supremum norm. 


Proof of Theorem 4.21. This statement is true if there exists no g e 
L 2 (f2, £F t , P), which is perpendicular to all X(T ) e We start by assum¬ 
ing that there is a g e L 2 ( il . Tr, P) such that g is orthogonal to all vari¬ 
ables X(T) e x I>t- This orthogonality means that E [X k (T)g] = 0, for all 
h 6 LE mple ([0, T ]; M n ). Or, what is the same, 

f e S T 0 h(s)-dw(s )-1 $ 0 T h{sf d S(m = {) ^ for aU h 6 L oo mpie T ]. ( 4 . 96 ) 

Jn 

The equalities in (4.96) are equivalent to 




ds 


r e fiHs)-dw( S )(w) gd P = 0 


foralUe£» mpfe ([0,r];R“), 


which amounts to the same as 

f e l T 0 h{s)-dW{s ) gdp = 0 , fo r all h6L s °° mple ([0,T];R n ), (4.97) 

By taking h as in (4.95), we see that for all A = (Ai,..., A#) e (M. n ) N = M nxJV 
and for all (H,..., £jv) e [0, T] N with 0 = to < ti <■■■< tjy = T, the following 
equality holds: e^ =1 Xj ' w ^gdF = 0. Next, put C(A) = ^ e^ =1 Xj ' w ^g dF. 

The function A >—► C(A) is real analytic on M nxJV , and thus has an analytic 
extension to the complex space C nxN : G(z ) := e^^ lZrW ^g dF for all z e 
C nxN . Here z 3 ■ Wj[t) = Ylk=i z j,kW k {t), z 3 = (z hk ,..., z n , k ) e C n . Since 
G(A) = 0 for A e M nxJV , it follows that G(z) = 0 for z e C nxJv . However, for 
c p 6 C® (M nxJV ), and with 
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where (yi,..., y ^) e M nxAr , we see that 

E [<p(W(t 1 ),...,W(t N ))g] = f <p(W(t 1 ),...,W(t N ))gdP 

Jn 

(inverse Fourier transform) 

= f (pTu f e lT ‘j=i w ( t 3>yj{p(y)dy) gdP 

JQ \{ Z7r J JR n / 

(Fubini’s theorem) 

f g dFfi(y) dy 

M n 


1 

~~ (2yr) n 

From the monotone class theorem, and the fact the space (7® (E nxAr ) is dense 
in Co (R n xn ) for the uniform topology, it follows that the equality in (4.98), i.e. 
the equality 

E[<p(W(t 1 ),...,W(t N ))g] = 0 (4.99) 

can only be true for all cp e (7® (R nxJV ), and for all (ti,... ,tpj) e (0, oo)^, 
0 < ti < ■ ■ ■ < = T, for all N e N, provided that E [Fg] = 0 for all 

bounded 3V-measurable random variables F. Consequently, E [X k (T)g] = 0 for 
all h e L® liple ([0, T]; R n ) if and only if the random variable g e L 2 (f2,TV, P) is 
identically 0. This completes the proof of Theorem 4.21. □ 


G{iy)0{y) dy = 0. 


(4.98) 



The following theorem is known as the Ito representation theorem. 

4.22. Theorem. If the random variable X(T) belongs to L 2 (f2,3V,P), then 
there exists a unique predictable R n -valued process t >—► F(t), 0 < t < T, for 
which ^E [|F(s)| 2 ] ds < go and which is such that 

X(T) = E [X(T)] + f F(s) ■ dW(s). (4.100) 

Jo 


In other words the space 


C+|J F(s)-dW(s) : s^F(s) 
coincides with L 2 (Q, 9V, E). 


predictable and 


f EpfMI 2 ] 

Jo 


ds < oo 


Proof of Theorem 4.22. Let X(T) be as in Theorem 4.22. Then there 

of elements in 


ke N 


exists double sequences ^(aq,fc)y!fi) in C and ^(%fc)^2u^ 

^simple ([0,T] ;E n ) such that, with F k (t) = Xji <Xj,kX hjtk (t)h jik (t) and with 
N k N k 

X k (T) := 2 a jtk X h . tk (T) = £ a jtk e^ 


3 = 1 


3 = 1 
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N k nT N k 

= Y + Yi a h kX F, k Whkit) • dW (0 

j =1 3 = 1 

= E [X k (T)]+ f F k (t)-dW(t), 

Jo 


we have 

0 = lim E [I X(T) - X k (T )| 2 ] = lim E [| X e (T) - X k (T )| 2 ] 

/c—► oo k/^co 

= ^ ||E pQ(T) - X fc (T)]| 2 + £e [|F,(0 - F k (t )| 2 ] d*j . 


(4.101) 


(4.102) 


From (4.102) we infer that E[X(T)] = lim^oo E [X k (T)] and that there exists 
a predictable process t h-> F(t) such that 

r T 

lim E [\F(t) — F k (t)\ 2 ] = 0. (4.103) 

fc—>co Jq 

From (4.103) we obtain 

L 2 - lim f F k (t) ■ dW(t) = 

k^co Jo 

A combination of (4.101), (4.102) and (4.104) yields the equality in (4.100). In 
addition, we have ^E[|F(s)| 2 ] ds < go, and so the existence part in Theo¬ 
rem 4.22 has been established now. The uniqueness part follows from the Ito 
isometry 


f T 1 

|F 2 ( S )-F 1 (s)| 2 ds 

= E 

• rT 2-1 

(F 2 (s)-F 1 (s))-dW(s) 

_Jo 


Jo 


if X(T) - E[A(T)] = Jo Fi(s) • dW(s) = ^ F 2 (s ) • dW(s). Altogether this 
completes the proof of Theorem 4.22. □ 


I 

Jo 


F(t) -dW(t). 


(4.104) 


Next we formulate and prove the martingale representation theorem. 

4.23. Theorem. Let (M(t)) 0 ^ t ^ T belong to M 2 (Q, TV, P). Then there exists 
a unique predictable W 1 -valued process t >—» ((t) = , ■ ■ ■, Cn{t)) , C(t) e 

L 2 (Q, T t , P; M n ), such that 

rt n rt 

M(t) = M( 0) + COO • dW(s) = M( 0) + V Ci(s) dWj(s). (4.105) 
Jo j =1 Jo 

Of course, if in Theorem 4.23 the process t >-*■ M(t), 0 ^ t ^ T, is a martingale 
vector in then we obtain a predictable matrix Z(t) e M. nxd such that M{t) = 
M( 0) + S Z(s)* dW(s). (This is what one needs in the context of Backward 
Stochastic Differential Equations or BSDEs for short.) 


Proof of Theorem 4.23. Let (M(£)) 0sgts , T be as in Theorem 4.23. The¬ 
orem 4.22 yields the existence of a unique predictable M n -valued process t >—► 
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C (t) = (Cl (t), ■ ■ ■, C n(t )), c (t) e L 2 (n, P; E n ), such that 


M(T) = 


rT n rJ 

E [. M(T )] + C(s) • dW{s) = E [. M(T )] + V 

Jo ;=1 Jo 


QMdW^s). (4.106) 


Since the processes t > M(t) and t >—> ^Cj(s) dWj(s), 1 < j < n, 0 < t < T, 
are martingales, from (4.106) we infer 


M{t) = E [M(T) | T t ] = E 


E 


[M(T)\ + f C(s)-dVF( S ) 

Jo 




= E 


[Af (T)] + f C(s) • dlT(s) = E [Af (0)] + f C(s) • dW(s) 

Jo Jo 


From (4.107) we get Af(0) = E[M(0)], and so the representation in 
follows from (4.107). The proof of Theorem 4.23 is complete now. 


(4.107) 

(4.105) 

□ 
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skills to meet the increasing need of businesses.The 
MSc programmes provide a stimulating and multi¬ 
cultural learning environment to give you the best 
platform to launch into your career 

* MSc in Business 


* MSc in Financial Economics 


* MSc in Strategic Marketing Management 


NORWEGIAN 
BUSINESS SCHOOL 


EFMD 

EQUIS 


*ffi 


* MSc in Leadership and Organisational Psychology 

www.bi.edu/master 


276 



Download free eBooks at bookboon.com 










Advanced stochastic processes: Part II 


Stochastic differential equations 


3. Girsanov transformation 


In this section we want to discuss the Cameron-Martin-Girsanov transformation 
or just Girsanov transformation. Let P) be a probability space with a 

filtration (3 r t) t>0 . In addition, let the process { B(t ) : t ^ 0} be a d-dimensional 
Brownian motion. Let bj , Cj, be Borel measurable locally bounded functions 
on W l . Suppose that the stochastic differential equation 

= x + f a (s, X(s)) dB(s) + 

Jo 

has unique weak solutions. For a precise definition of the notion of “unique 
weak solutions” see Definition 4.19. For more information on transformations 
of measures on Wiener space see e.g. Ustiinel and Zakai [139]. In particular 
these observations mean that if in equation (4.109) below (for the process Y(t)) 
the process B'(t) is a Brownian motion relative to a probability measure P', 
then the P'-distribution of the process Y(t) coincides with the P-distribution of 
the process X(t) which satisfies (4.108). Next we will elaborate on this item. 
Suppose that the process t > Y(t) satisfies the equation: 



f 


b (s,X(s)) ds 


(4.108) 


Y(t) =x + 


= x + 


fu( s ,y( s )) dB(s)+ f 

JO JO 

f a(s,Y(s))dB'(s) + f 

JO JO 


(b (s, Y(s)) + (7 (s, y (s)) c (s, Y(s))) ds 
b(s,Y(s))ds, (4.109) 


where B'(t) = B(t ) + Lc (s, Y (s)) ds. The following proposition says that rela¬ 
tive to a martingale transformation P' of the measure P (Girsanov or Cameron- 
Martin transformation) the process t >—> B'(t ) is a P'-Brownian motion. More 
precisely, we introduce the local martingale M'(t) and the corresponding mar¬ 
tingale measure P' by 

M\t) = exp J c (s, y(s)) dB(s) — ^ |c (s, y(s))| 2 ds^j and (4.110) 
r[A]=E[M'(t)l A ], Ae%. (4.111) 


We also need the process Z'(t) defined by 

z \t) = ~[ c (Y Y(s )) dB(s) - \ £ |c (s, y(s))| 2 ds. (4.112) 

In addition, we have a need for a vector function ci(t,y) satisfying c(t,y) = 
ci(t,y)a(t,y). We assume that such a vector function ci(t, y) exists. 


4.24. Proposition. Suppose that the process Y(t) satisfies the equation in 
(4-109). Let the processes M'(t ) and Z'{t) be defined by (4-110) and (4-112) 
respectively. Then the following assertions are true: 


(1) The process t >—► M'(t) is a local ¥-martingale. It is a martingale 
provided E \M'(t)] = 1 for all t ^ 0. 

(2) Fix t > 0. The variable M'(t ) only depends on the process s *—> Y(s), 
0 < s ^ t. 
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(3) Suppose that the process t *—> M'(t ) is a ¥-martingale, and not just a 
local ¥-martingale. Then P' can be considered as a probability measure 
on the a-field generated by u t> oT t . 

(4) Suppose that the process t *—> M'(t ) is a ¥-martingale. Then the process 
1 1 —> B'(t ) is a Brownian motion relative to P'. 


Proof. 1 From Ito calculus we get 

M\t)-M\ 0) = - f M'(s)c(s,Y(s))dB(s), 

Jo 

and hence assertion 1 follows, because stochastic integrals with respect to Brow¬ 
nian motion are local martingales. Next we choose a sequence of stopping times 
r n which increase to go P-almost surely, and which are such that the processes 
t I—> M' (t a r n ) are genuine martingales. Then we see E [ M' (t a r n )] = 1 for 
all n e N and t ^ 0. Fix t 2 > t\. Since the processes t >—> M' (t a r n ), n e N, 
are P-martingales, we see that 

E [M ; (t 2 a r n ) | fhj = M' (t\ a r n ) P-almost surely. (4.113) 

In (4.113) we let n —> go, and apply Scheffe’s theorem to conclude that 

E [M' ( t 2 ) | T tl ] = M' (fii) P-almost surely. (4.114) 

The equality in (4.114) shows that the process t >—► M'(t) is a P-martingale 
provided that E [ M'(t )] = 1 for all t ^ 0. This completes the proof of assertion 
1. 2 This assertion follows from the following calculation: 

Z\t) = - £c( S ,T(s)) dB(s) - ± Jjc( s ,r(s))| 2 ds 

= - £ dB'(s) + g f l c (s,^( s ))| 2 ds 

(c{s, y) = c 1 (s,y)a(s,y)) 

= -J ci (s,T(s))cr(s,y(s)) dS'(s) + ^J |c(s,T(s))| 2 ds 

= - £ Cl (s, Y(s)) d (£ a (r, Y(r)) dB\r £ + \ £ |c (s, K(s))| 2 ds 

= - £ Cl (s, Y(s)) d (y(s) - £ b (r, Y (r)) dr^j + ± £ |c (s, Y (s))| 2 ds. 

(4.115) 

From (4.115), (4.110), and (4.112) it is plain that M'(t ) only depends on the 
path (y(s) : 0 < s ^ t}. 

3 This assertion is a consequence of Kolmogorov’s extension theorem. The 
measure is P' is well defined on u t >oTt. Here we use the martingale property. 
By Kolmogorov’s extension theorem, it extends to the cr-field generated by this 
union. 
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4 The equality B'(t ) = B(t) + ^c(s,T(s)) ds entails the following equality 
for the quadratic covariation of the processes B[ and B'-\ 

(4.116) 

From Ito calculus we also infer 


M\t)W) 

= I M’(s)B' i (s)dZ , (s) + f M'(s)dB'(s) 

Jo Jo 

+ \ f M’(s)B’(s) d <Z', Z') (a) + f M'(s)d (Z' t B\) (s) 

- - f M'(s)B' i (s)c (. a,Y(a )) AB(s) - i f M’(a)B’(s) \c (a, Y(s))\ 2 ds 

+ 5 f M'(s)B[(s)\c(s,Y(s))\ 2 ds+ f M'(a)dB,(s) 

* Jo Jo 

+ f M'(s)d(s,Y(s)) ds - f M'(s)g(s,Y(s)) ds 

Jo Jo 

= - f M'(s)5;(s)c(s,y(s)) d5(s)+ f M , (s)dB i (s). (4.117) 

Jo Jo 

Upon invoking Theorem 4.5 and employing (4.116) and (4.117) assertion 4 fol¬ 
lows. 


This concludes the proof of Proposition 4.24. 


□ 
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Let the process X(t) solve the equation in (4.108), and put 

M(t) = exp c (s, X(s)) dB(s) — - J \c (s, X(s ))| 2 ds^j , (4.118) 

and assume that the process M(t) is not merely a local martingale, but a genuine 
P-martingale. 

4.25. Theorem. Fix T > 0, and let the functions 

b(s,y), cr(s,y), c(s,y), andcfis,y), 0 < s T, 

be locally bounded Borel measurable vector or matrix functions such that c(s, y) = 
Ci(s,y)a (s,y), 0 < s < T, y e M. d . Suppose that the equation in (4.108) pos¬ 
sesses unique weak solutions on the interval [ 0 .T], 

Uniqueness. If weak solutions to the stochastic differential equation in (4-109) 
exist, then they are unique in the sense as explained next. In fact, let the couple 
(Y (s), B(s)), 0 ^ s < t, be a solution to the equation in (4-109) with the property 
that the local martingale M'(t ) given by 

M'(t) = exp j c (s, T(s)) dB(s) — ~ J \c (s, Y(s ))| 2 ds^j . (4.119) 

satisfies E [M'(tj] = 1. Then the finite-dimensional distributions of the process 
T(s), 0 < s < t, are given by the Girsanov or Cameron-Martin transform: 

E [/ (T (G) ,...,¥ (* n ))] = E [M(t)f (X (G) ,..., X (t n ))] , (4.120) 

t ^ t n > ■ ■ • > G ^ 0, where f : M. d x • • • x M. d —> M is an arbitrary bounded 
Borel measurable function. 


Existence. Conversely, let the process s > (X(s), B(s)) be a solution to the 
equation in (4-108). Suppose that the local martingale s > M(s), defined by 

M(s) = exp c{T,X{r))dB{r) - ^ J \c(r, X(t))\ 2 dr^j , 0 < s < t, 

(4.121) 

is a martingale, i.e. E[M(t)] = 1. Then there exists a couple ^T(s),5(s)^ ; 
0 < s < t, where s > B(s), 0 ^ s < t, is a Brownian motion on a probability 
space ( Q, 3 \ P) such that 


Y(s) = x + 


+ 


a (t,T(t)) dB(r) + 

[fir.Yd 


a (r, Y (r)) c (t, Y(t) J dr 


t) dr, 


and such that 


E 


exp 


-f 


c(s,y(s)) dB(s) -c(s,y(s 


-if 

2 Jo 


ds 


(4.122) 


= 1 . (4.123) 
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4.26. Remark. The formula in (4.120) is known as the Girsanov transform 
or Cameron-Martin transform of the measure P. It is a martingale measure. 
Suppose that the process t ► M 7 (t), as defined in (4.110) is a P-martingale. 
Then the proof of Theorem 4.25 shows that the process t •—> M(£), as defined 
in (4.118) is a P-martingale. By assertion 1 in Proposition 4.24 the process 
t i-> M'(t) is a P-martingale if and only E = 1 for all T ^ t ^ 0, and a 

similar statement holds for the process 1 1 -> M(t). If the process t > M\t) is a 
martingale, then taking G m 1 in (4.135) shows that E [M(t)] = 1, and hence 
by 1 in Proposition 4.24 the process 1 1 -> M(t) is a P-martingale. Conversely, if 
the process t ► M(t) is a P-martingale, then we reverse the implications in the 
proof of Theorem 4.25 and take F m 1 in (4.139) to conclude that E \M'(t)\ = 1 
for all ^ 0. But then the process 1 1 —► M\t) is a P-martingale. 


Notice that the process t >- 
dition is satisfied, i.e. if E 


M(t) is a P-martingale provided Novikov’s con- 


exp 


ip 

2 Jo 


c (s,X(s))\ 2 ds 


< go. For a precise 


formulation see Corollary 4.27 below. Define 


£(M)(f) = e M W-i< M ’ M >W. (4.124) 


4.27. Corollary. If supE 

o 


E 


exp ( - (M, M) {t) 


exp ( M( go) — - (M, M) (go) 


< go ; then 

= 1 , 


and consequently the process t > E(M)(t) is a P -martingale relative to the 
filtration where = cr(M(s) : 0 < s ^ t), the a-field generated by the 

variables M(s), 0 ^ s ^ t. 


Novikov’s result is a consequence of results in [76]; see Chapter 1 of [146]. 
Observe that M(oo) = lim t ^oo M(t) exists P-almost surely. 


4.28. Remark. Let s > c(s ) be a process which is adapted to Brownian 
motion in Wf and let p > 0 be such that Novikov’s condition is satisfied: 


E 


exp 


(|p 2 £l c ( s )l 2rfs ) 


< oo. From assertion 4 in Proposition 4.24 and The¬ 


orem 4.25 we see that the following identity holds for all bounded Borel mea¬ 
surable functions F defined on (M^)™: 


E [F (V (ti) ,... ,Y p (t n ))] 


= E 


exp 




F(B B(Q) 


(4.125) 


where 0 ^ t\ < ■ ■ ■ < t n < t, and Y p {r ) = B(t ) + c(s ) ds, 0 < r ^ t. In 
particular, if n = 1 we get 


E 


F ( B[t) + p c(s ) ds 


f 
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E 


exp^pj c(s)dB(s) - ^p 2 J |c(s)| 2 cfej F (B (t)) 


(4.126) 


Assume that the gradient DF of the function F exists and is bounded. The 
equality in (4.126) can be differentiated with respect to p to obtain: 


E 


E 


B{t) + p J c(s) dfij , J* c(s) ds 

3 M dB ^ - \ p T |c(s)| 2 ds 

J c(s) dB(s) — p J |c(s)| 2 ds^ F (B(t)) 


(4.127) 

The bracket in the left-hand side of (4.127) indicates the inner-product in W l . 
In (4.127) we put p = 0 and we obtain the first order version of the famous 
integration by parts formula: 


E 


DF(B(t )),J c(s)ds ^ =E J c(s)dB(s)F(B(t )) 


(4.128) 


We mention that the Cameron-Martin-Girsanov transformation is a cornerstone 
for the integration by parts formula, which is a central issue in Malliavin cal¬ 
culus. For details on this subject see e.g. Nualart [103, 102], Malliavin [92], 
Sanz-Sole [118], Kusuoka and Stroock [78, 79, 80], Stroock [127], and Norris 

[ 100 ], 
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For a proof of Theorem 4.25 we will need the Skorohod-Dudley-Wichura repre¬ 
sentation theorem: see Theorem 11.7.2 of Dudley [41]. It will be applied with 
S = C ([0, t], M d ) and can be formulated as follows. 

4.29. Theorem. Let ( S,d ) be a complete separable metric space (i.e. a Polish 
space), and let P&, fceN, and P be probability measures on the Borel field B$ of 
S such that the weak limit w— lim^oo P& = P, i. e. lim^oo $ FdPk = $ FdP for 
all bounded continuous functions of F e Cb(S). Then there exist a probability 

space ^0,7, P^ and S-valued stochastic variables Y^, feN, and Y, defined on 
D with the following properties: 


(1) P k [B] = P 


Y, e B 


k eN, and P [B] = P 


Y e B 


B e B, 


(2) The sequence Y\., k e N, converges to Y P -almost surely. 

4.30. Remark. An analysis of the existence part of the proof of Theorem 4.25 
shows that the invertibility of the matrix a(s,y) is not needed. Let N(s), 

0 < s < t, be a local martingale on a filtered probability space (12,7 S ,P), 


where the a-field 7 S is generated by (Y (r) : 0 < r < s ). Suppose that the 
covariation process of N(s) is given by 




-f 


a ( t, Y(t 


dr , 1 < ji, j 2 < d. 


r, Y (r) 

JO V V 7 V / / J1,J2 

Here Y is a local martingale on P^. Then by assertion (iii) in Theorem 

4.17 there exists a Brownian motion £>(s), 0 ^ s < on this space such that 


J Cl (t,Y(t)^ dN (r) = J Cl (r,T(r)) a (t,T(t)) dB{r) 


= J c(t,Y(t)) dB{r). 


(4.129) 


Proof of Theorem 4.25. Uniqueness. Let the process 7(s), 0 < s ^ t, 
be a solution to equation (4.109). So that 

Y(s)=x + f cr (t, Y(t)) dB{r) + f {b{r,Y{r)) + a {t,Y{t))c{t,Y{t))) dr 

Jo Jo 

= x + f a{r,Y{r))dB 

Jo 

Let F ((y(s)) 0 ^ ssgt ) be a bounded stochastic variable which depends on the 
path y(s), 0 < s < t. As observed in 4 of Proposition 4.24 the process B\t) 
is a P'-Brownian motion, provided E [M'(t )] = 1. Uniqueness of weak solutions 
to equation (4.108) implies that the P'-distribution of the process s >—> Y(s ), 
0 < s < t, coincides with the P-distribution of the process s >—► X(s), 0 < s < t. 
In other words we have 

e' [f ((r( s )) oa , sl )] 


I 


'(r) + b (r, Y(t)) dr. 


(4.130) 
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- E 


exp 


[ Ci(»,n»)) ‘Wps) - \ J k(», Y (s))| 2 ds) F «y(«))«,«) 

(4.131) 


= E [F((X(s)) om )], 
where 

N y (s) = Y(s)— f cr (t,Y(t)) c(t,Y (t)) dr — f 6 (r, Y(r))dr 

Jo Jo 

= f o- (t,Y(t)) dB(r). (4.132) 

Jo 

With 

g ((rWWJ 

- f c,(s,y(s)) dN y (s) - if |c (s, r (s)) I 2 (is) F ((y (s)) 0 «., sl ) 


fdls.ns)) dN Y (s) + [ i|c(s,y(s))| 2 ds) G((y(s)) os ,J 


= exp 
we have 

F ((*») „««,) 

= exp 
So, since 

dN x (s) = dX(s) — a (s,X(s)) c (s,X(s)) ds — b (s,X(s)) ds 
= a (s, X(s)) (d£(s) - c (s, X(s)) ds) 

it follows that 

F ((*«W) 

= exp 

= exp 


(4.133) 


£ C! (s, A'(s)) diV A ' (s) + i Jjc (s, X(s ))| 2 ds) G «A(s)) 0< , s( ) 
(fc(s, A(s)) dB(s) - i Jjc (s, X(s ))| 2 ds) G ((X(s)) 0fsft ) 


(4.134) 


From (4.131) and (4.134) we infer: 

E[G((n*))o<.J] 


(4.135) 


E 


exp 


J q c(s,X(s)) ds- 1 \c(s,X(s))\ 2 ds \ G ((X(s)) 0M ) 


By inserting G = 1 in (4.135) we see that 


E 


exp 


fc( S ,X(s)) ds-lJjc(s,X(s))| S 


ds 


= 1 


in case there is a unique solution to the equation in (4.122). This proves the 
uniqueness part of Theorem 4.25. 
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Existence. Therefore we will approximate the solution Y by a sequence Y kl 
he N, which are solutions to equations of the form: 

Y k (s)=x + f a(T,Y k (r)) dB(r) 

Jo 

+ f (b (r, Y k (r)) + a (r, Y k {f)) c k (t, Y k (r))) dr 

Jo 

= x+ f a (r, Y k (r)) dB' k (r) + f b (r,Y k (r)) dr. (4.136) 

Jo Jo 

Here 

B k (s) = B k (s ) + f (r,Yfc(r)) dr, 

Jo 

and the coefficients c k (s,y ) = Ci :k (s,y)a (s,y) are chosen in such a way that 
they are bounded and that c(s, y) = lim^oo c k (s, y) for all s e [ 0 , t] and y e M d . 
By Novikov’s theorem the corresponding local martingales M' k , given by 

M' k {s) = exp J c k (T,Y k (r)) dB(r) - ^ J \c k (r,Y k (r))j 2 dr^j , k e N, 

are then automatically genuine martingales: see Corollary 4.27. From the 
uniqueness of weak solutions to equations in X(t) of the form (4.108) (and 
thus to equations in Y k (s) of the form (4.136) we infer 

K F ((UM )„««)] =E[F ((X(<0)o««)]. (4,137) 

In equality (4.137) the process Y k (s ), 0 F s F t, solves the equation in (4.136). 
The equality in (4.137) can be rewritten as 

E [M' k (t)F ((Y k («))„„«)] =E[F (FM)o s ,«)]. (4.138) 


By (4.115) the equality in (4.138) can be rewritten as 


E 


exp 


= E 


exp 


J c t (s, n W) dB(s) -1J I c k (s, U(s))| 2 ds) F ((Y„(s)) owt ) 

\ c 1J ,(s,Y k (s)) d (y»(s) - [b(T,Y k (T)) dT 
Jo V Jo 


+ 2 


\ J I c k ( s ,u( S ))i 2 ds) f ((nw) 0< .«) 


- E [F (FM)o,«,)] • 


(4.139) 


Let G ((U( s ))o< s <t) be a (bounded) stochastic variable which depends on the 
path Y k (s), 0 ^ s < t. From the equality in (4.139) we infer 


e [g «nM) 0< .«)] 


E 


J c I}k (s,X(s))dfx(s)-J b(r,X (r)) dr 
£ F F*M)I 2 ds) a ((X(s)) 0fsft ) 
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= E 


exp 


f C 1)fc (s, X(s)) a (s, X(s)) dB(s ) - J f |c fc (s, X(s))| 2 ds 
Jo * Jo 


X G ((*(»))„<.«) 

-E[M,(t)G((X( s )) os , st )]. (4.140) 

Here the martingales Mk(s) are given by 

M fc (s) = exp^J c fc (r,X(r))dB(r) - i J |c fc (r, X(r))\ 2 dr^j , ke N, 

This fact together with the pointwise convergence of M^(s) to M(s), as k —> go, 
and invoking the hypothesis that E [M(t)] = 1, shows that the right-hand side 
of (4.140) converges to E [ M{t)G ((X(s)) 0sgssgt )]. In other words the distribu¬ 
tion P 5 ^ of Yk converges weakly to the measure P M,X defined by F M,X (A) = 
E [M(t), X e A], where A is a Borel subset of the space C ([0,t],M d ). By the 
Skorohod-Dudley-Wichura representation theorem (Theorem 4.29) there exist 

a probability space ^12, T, and C ([0, t], M d )-valued stochastic variables Yk, 
k eN, and Y, defined on 12 with the following properties: 


(1) P y *[5] = P 


Be 3 


Yk e B 


, k e N, and F M ’ X [B] = 


YeB 


, for all 


C([0,t],M d ) • 


(2) The sequence (1).) converges to Y P-almost surely. 

' ke N 
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By taking the limit in (4.140) for k —► go and using the theorem of Skorohod- 
Dudley-Wichura we obtain 


E 


G\[Y(s 




= E[M(t)G((X(s)) 05s5t )] (4.141) 


where G is a bounded continuous function on C (| U i | F"' ; ) Then we consider 
the process N(s), 0 C s ■■ / defined by 


N(s) 


=- r 

Jo 


-f 


a It, Y(r)\ c ( t, Y (t) ) dr — b It, Y(r) J dr. (4.142) 


If y(s) were Y(s), then by (4.130) N(s) would be N Y (s), given by the formula 
in (4.132). Hence the process s >—► N Y (s), s e [0,f], is a stochastic integral 
relative to Brownian motion on the space (f2,3q,P). We want to do same for 

the process s >—> N(s), 0 < s < t, on the probability space (f2,5F, P^. Let 

pM(t) | )e |-} ie probability measure on (Q. 3y) defined by P M 0) [T] = E [M(t), A], 
A e Jq. Then like in item (4) of Proposition 4.24 we see that the process s >—» 
B(s ) — ^ct(t, X(t)) dr is a P M ^-Brownian motion. In addition, from (4.141) 
and (4.142) we infer that the P-distribution of the process N(s), 0 ^ s ^ t, is 
given by the P Mfi, -distribution of the process 

s i-*-X(s) — f a (t, X(t)) c (t, X (t)) dr — f b(r,X(r)) dr 

Jo Jo 


= f <t(t,X(t)) (dB(r) - c(t,Y (t)) dr) 

Jo 


0 

= \\{t,X(t)) dP M(t) (r), 

Jo 


(4.143) 


where B M ^\s) is a P M (^-Brownian motion: see Proposition 4.24 item (4). It 
also follows that the process in (4.143) has covariation process given by the 
square matrix process 


f <t(t,X(t))<j* (t,X(t)) dr, 0 

Jo 


^ s < t. 


Consequently, the process s N(s), 0 < s < t, is a local P-martingale with 
covariation process given by 


s h->• J a (r, Y(r )) a* (r, Y(r )) dr, 0 ^ s ^ t. 


(4.144) 


o 

In order to prove (4.144) we must show that the process 

d 

s %( s )%( s ) - 2 J o °h,k (t,Y(t)^ a h:k (r,Y(r)j dr 

is a local P-martingale. The latter can be achieved by appealing to the fact the 
P-distribution of the process s ► T(s), 0 ^ s ^ t, coincides with the P M W_ 
distribution of the process Then we choose a Brownian 

motion £>(s), possibly on an extension of the probability space 5F, P^, which 
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we call again ^12, T, P^ such that N(s) = $*er (r, Y(t)^J dB{r). For details see 

the proof of the implication (ii) => (iii) of Theorem 4.17. With such a Brownian 
motion we obtain: 


Y(s) = x + (T (t,Y(t)j dB{r 


1 

r 


+ a (r, Y (r)) c (r, Y (r)) dr + 




r)) dr. (4.145) 


Since 


E 


exp 


-f 


c(s,y( s )) dB(s) -c(s,y( s 


-if 

2 Jo 


ds 


= 1 (4.146) 


it follows that the process s >—> B(s) + $gC \ t,Y(t) J dr is a Brownian motion 
relative to the measure 




exp 


-J c(s,y(s)) (TB(s)-^J |c(s,y(s 


ds ) , A 


, A e 3 r . 


The equalities in (4.145) and (4.146) complete the proof of Theorem 4.25. □ 


3.1. Equations with unique strong solutions possess unique weak 
solutions. The following theorem shows that stochastic differential equations 
with unique pathwise solutions also have unique weak solutions. Its proofs puts 
the Levy’s characterization of Brownian motion at work: see Theorem 4.5. 

4.31. Theorem. Let the vector and matrix functions b(s,x ) and a(s,x ) be as 
in Theorem J h 25. Fix x e M. d . Suppose that the stochastic (integral) equation 

a (s, X(s)) dB(s) + f b(s,X(s)) ds (4. 

Jo 

possesses unique pathwise solutions. Then this equation has unique weak solu¬ 
tions. 

In the proof we employ a certain coupling argument. In fact weak solutions to 
the equations in (4.3) and (4.4) are recast as two pathwise solutions of the same 
form as (4.147) on the same probability space. 

Proof. Let {(B(t) : t A 0),(Q,T,P)} and {(B'(t) : t A 0), (fi', 3*, F)} be 
two independent Brownian motions. Without loss of generality it is assumed 
that, for 0 < t < go, 

= a B(s ) : O^s^fj^JlTleT 0 : P [T] = 0}^ , and 

= a ({ B\s ) : 0 ^ s ^ t} |J {A! e T' : P [A'] = 0}) . 

Moreover, T = a (Ut>o^)> ari( l a a similar assumption is made for Tk Let 
{X(t) : t ^ 0} be an adapted process which satisfies (4.3), and let {X'(t) : t ^ 0} 
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be an adapted process which satisfies (4.4). Suppose 0 ^ t\ < t 2 < ■ ■ ■ < t n < go, 
and let C\,..., C n be Borel subsets of W l . We have to prove the equality: 

P' [X' fa) e Ci,... ,X' (t n ) eC n ]= P [X (tt) eCi,...,X (t n ) e C n ]. (4.148) 

Let (f2 0 > T 0 , Po) be a probability space with a Brownian motion {B 0 (t) : t ^ 0} 
such that lb 0 = a ({ B 0 (t ) : t ^ 0} (J {A 0 6 lb 0 : P 0 [A 0 ] = 0}). Define the Re¬ 
valued processes Y(t), Y'(t), and B 0 (t) on D x O' x D 0 as follows: 

Y(t) (u,u',Uo) = X (t) (u;), (u,u',ojo) e ft x ft' x f2 0 > 

< Y'(t ) (uj, uj', luq) = X'(t) ( uj'), (uj, uj' , u>o) e fi x O' x f2 0 > (4.149) 

B 0 (t) (uj,uj' ,ujq) = Bo(t ) (tuo), (uj,uj' ,ujq) e ft x ft' x fi 0 . 

In fact we use the notation O 0 instead of ft to distinguish the third component 

of the space fi x fi' x O 0 from the first. The role of the first two components 
are very similar; the third component is related to the driving Brownian motion 
(Bo(t) : t ^ 0}. The processes Y(t) and Y'(t) are going to be the pathwise 

solutions on the same probability space ^0 x ft' x fi 0 , T® 3" 0 Q x ^ : see 

(4.159) and (4.160) below. On f2 0 the probability measure P 0 is determined by 
prescribing its finite-dimensional distributions via the equality: 

Po [{Bo (ti ), • • •, Bo ( t n )) 6 D] = P [(B (ti ),..., B (; t n )) e D] 

= r[(B'(t 1 ),...,B'(t n ))eD], (4.150) 



qaiteye 
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In (4.150) we have 0 ^ t\ < ■ ■ ■ < t n < oo, and D is a Borel subset of (M d ) n . 
Let C be another Borel subset of (M d ) . On Q x Q 0 and O' x Oo the probability 
measures Q x and Q' x are determined by, respectively, the equalities: 

Qx [(-X" (Q) j • • • ? X (t n )) e C , (f?o (t \),..., f?o ( t n )) e D] 

= P[(X(f 1 ),...,X(t n ))eC, (B(ti),...,B(g)6D], and 
Q' [(X' (H),..., X' (t n )) e C, (B 0 (h) (t n )) e D] 

= P' [(X' (B ),..., X' 0 t n )) e C, {B> (h) (t n )) e D ]. (4.151) 


Notice that P 0 [^4 0 ] = 0 implies (Qh [0 x A 0 ] = Q x [O' x H 0 ] = 0. Consequently, 
by the Radon-Nikodym’s theorem there are (measurable) functions 

Q x : 2 x O 0 — [0,1], and Q’ x : 2' x O 0 — [0,1] 


such that, respectively, 

Qa; [hi x ^4 0 ] = | Qx (A, n?o) dPo (cjo) , A g IP, Aq e 3^) and 
Ja 0 

Q' x [A 1 xA 0 ]= ( Q’ x (A, w 0 ) dP 0 (w 0 ), A! e X, Ao e 2°. (4.152) 

JA 0 

Here Q x ( O,w 0 ) = <5 x (O',w 0 ) = 1 for P 0 -almost all w 0 e O 0 . Moreover, the 
functions 

wo l—> Qx (A, Wo), and wo > Q' x (A, wo) (4.153) 

are measurable relative to the P 0 -completion of T°. In addition, the set functions 
A <-*■ Q x (A,luo), A e 2, and A’ ^ Q' x (A',cu o), A' e 2' are Po-almost surely 
probability measures. Here we use the fact that, except for negligible sets, the 
cr-fields 2 and 2' are countably determined. Finally, we define the measure 

Qa, : IF 0 2' 0 2° —> [0,1] via the equality 
Qa, [A x A! x H 0 ] = Q x (hi, w 0 ) Q' x (A', w 0 ) 1 a 0 (wo) hPo (w 0 ) 

= Eo [wo ► Q x (A, w 0 ) Q' x (A 1 , wo) 1 a 0 (wo)] • (4.154) 

Here A, A', and A () belong to 2, 2', and 2° respectively. First we prove that 
the process ^B 0 (t) : t ^ 0 j is Brownian motion with respect to the measure Q x . 

The corresponding expectation is written as E x . From the proof of Theorem 
4.5 (i.e., Levy’s characterization of Brownian motion) it follows that it suffices 
to show that the following equality holds: 

Ea; [exp (-z (%, B 0 (t ) - B 0 (s)^ | 2 S 0 2' s 0 3\[ 

= exp - |£| 2 (t — s)^j , t > s ^ 0, £ e (4.155) 

By definition 2 S = a (B(p) : 0 C p N s). Similar definitions are employed for 
3%, and for the cr-field 9^. In order to prove (4.155) we pick A e 2 S , A' e T', 
and A 0 e 3^. Then by (4.154) we get 


E, 


exp f-z U,B 0 (t) 



1 


AxA'xAq 
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= exp \-iU, B 0 (t) —.Bo(s))) dQ x 

JAxA'xAo V X /y 

= E 0 [w 0 >-► exp (—i (f, £ 0 (0 (w 0 ) - 5 0 (s) (w 0 ))) 

x Qx (A wo) Q' x (A 1 ,u> o) l^o (wo)] • (4.156) 


The process (oj 0 ,t) >—> B 0 (t ) (w 0 ) is a Brownian motion relative to P 0 , and the 
events A, A', and A 0 belong to T s , and A? respectively, and hence the 
variable B 0 (t) — B 0 (s ) is P 0 -independent of the variable 

w 0 | — ► (Awo) (A', wo) lAo (wo) • 

Therefore (4.156) implies 


E T 


exp h-i (£, B 0 (t ) - £ 0 (s)yJ lixA'xAo 
f (A w 0 ) (3 X (A, w 0 ) dP 0 (w 0 ) x f exp (-* (£, B 0 (t) - B 0 (s))) 

JAn J 


dPn 


[Ax A x A 0 ] exp ( ~ |£| 2 (t - s) ) . 


(4.157) 


The equality in (4.155) is a consequence of (4.157). Since, by definition (see 
(4.150)) 

P 0 [(Bo (B) ,...,B 0 (t n )) e C] = P [( B (B) ,...,B (t n )) e C] (4.158) 

for 0 ^ t\ < ■ ■ ■ < t n < oo, C Borel subset of (R d ) , and since the process 
{B(t) : t ^ 0} is a Brownian motion relative to P, the same is true for the 
process {B 0 (t) : t ^ 0} relative to Pq. Next we compute the quantity: 


E T 


Y(t) — x — f a (s,Y(s)) dBo(s) — f b(s,Y(s))ds 

Jo Jo 

f X(t) — x— f a (s,X(s)) dB(s) — f b(s,X(s))ds 

J Jo Jo 


dP = 0. (4.159) 


Similarly we have 


Y\t) -x- 


f <7(s,y'(s)) dB 0 (s)- f b(s,Y'(s))ds 

Jo Jo 

= f X'(t)-x - f a(s,X'(s)) dB'(s) — f b(s,X(s))ds 

J Jo Jo 


dP' = 0. (4.160) 


From (4.159) and (4.160) we infer that the following equalities hold Q x -almost 
surely: 

i(s) + f b(s,Y(s))ds and (4.161) 

Jo 

o(s) + f b(s,Y'(s)) ds. (4.162) 

Jo 

Moreover, the process j-E> 0 (t) : t > 0 j is a Brownian motion relative to Q x . 
From the pathwise uniqueness and the equalities (4.161) and (4.162) we see 


V(/) = x + 
Y\t) = x + 


f °(s,Y(s)) 

Jo 

Jo 


dB(] 

dR 
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that, Q x -almost surely, 

Y(t) = Y'{t ), t > 0. (4.163) 

Let 0 < 0 < fi < • • • < t n < oo, and let C be a Borel subset of (R d ) n - From 
(4.163) it follows that 

4 [(Y (t x ) ,...,¥ 4)) e C] = 4 [(¥' 4) 4)) e C ]. (4.164) 

Using (4.164) and the definition of the measure Q x shows that the following 
identities are self-explanatory: 

4 i(Y 4) ,...,¥ (4) eC] = Q, [(X 4),..., X (t n )) e C, fi 0 ] 

= P [(X 4),..., X 4)) eC,Q]= P [(X 4),..., X 4)) e C ]. (4.165) 

The definition of the measure 0,„ is given in (4.154). Similarly we conclude 

4 [(Y‘ 4),..., Y' (t n )) eC]=F [(X' 4),..., X' (t n )) e C ]. (4.166) 

From (4.165), (4.166), and (4.164) we obtain 

P [(X 4),..., X 4)) e C] = P [(X' 4),..., X' 4)) e C ]. (4.167) 

The equality in (4.167) implies that the finite-dimensional distributions of the 
solution in equation in (4.3) are the same as those of the solution of equation 
(4.4). So that stochastic differential equations with unique pathwise solutions 
also possess unique weak (or distributional) solutions. 

This concludes the proof of Theorem 4.31. □ 

4.32. Example (Tanaka’s example). Let the process t >—► B(t), t 5= 0, be one¬ 
dimensional Browmian motion on the probability space (f2,T, P), and let the 
continuous process t i—> X(f) be such that X(f) = $*sgn(X(s)) dB(s). Here 

y 

sgn(y) = -j— j- for y / 0, and sgn(y) = 0, when y = 0. It can be proved that such 

a process exists. If t >—> X (t) solves this equation, then the process t —X(t) 
is a solution as well. So we see that the equation dX(t ) = sgn(X(f)) dB(t), 
X(0) = 0, does not have pathwise unique solutions. On the other hand the 
process t i—> X(f) is (local) martingale, and, since B(t) = J*sgn(X(s)) dX(s), 
we get 

t = (B , B) 4 = f |sgn (X(s))| 2 d (X, X) (s) = (X, X) (t). 

Jo 

Hence, (X, X) (t) = t. Levy’s martingale characterization of Brownian motion 
(see Corollary 4.7 and Theorem 4.5) then implies that the process t >—► X(t) 
is a Brownian motion on (U, T, P). So that the distribution of X(t) is that 
of Brownian motion. Consequently, the equation dX(t ) = sgn(X(t)) dB(t) has 
unique weak solutions. For more details on Tanaka’s example and its connection 
with local time see, e.g ., 0ksendal [ 106 ]. 

Conclusion. In this chapter we treated several aspects of the theory of stochas¬ 
tic differential equations: strong and weak solutions, Levy’s characterization 
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of Brownian motion, exponential martingales, Hermite polynomials with appli¬ 
cations to exponential martingales, a version of the martingale representation 
theorem, and the Girsanov or the Cameron-Martin-Girsanov transformation. 


This e-book 
is made with 

SetaPDF 


QO 



SETASIGN 




PDF components for PHP developers 


www.setasign.com 


293 




Click on the ad to read more 


Download free eBooks at bookboon.com 





Download free eBooks at bookboon.com 



Advanced stochastic processes: Part II 


Some related results 


CHAPTER 5 

Some related results 


In this section we will discuss, among other things, Fourier transforms of dis¬ 
tributions of random variables, positive-definite functions, Bochner’s theorem, 
Levy’s continuity theorem, weak convergence of measures, ergodic theorems, 
projective limits of distributions, Markov processes with one initial probability 
measure, Doob-Meyer decomposition theorem based on Komlos’ theorem. 

1. Fourier transforms 

Since we will also need signed measures, we will discuss them first. 

1.1. Signed measures. Let M = M(R",C) be the vector space of all 
complex Borel measures on M !/ . and let M + be the convex cone of all positive 
finite Borel measures op R". Then we have M = M + —M+-M (M+ — M+). Thus, 
every complex Borel measure /i on R 1 ' can be written as /i = /ii — p 2 Ai (/Mi — /i 4 ), 
where /ii, /i 2 , p 3 and /i 4 are finite positive Borel measures. In fact the measures 
fij, 1 ^ j ^ 4, can be chosen in the following manner: 

Pi(B) = sup {Re p(C ) : C ^ B, C Borel }; 

P 2 (B) = sup {—Re fi(C) : C <= B, C Borel } ; 
p 3 ( B ) = sup {Im /j,(C) : C <= B, C Borel }; 

/ 14 (B) = sup {—Im /i(C) : C <= B, C Borel }. 

For this choice of the measures fii, /i 2 , 7/3 and /i 4 , the measures ji\ and // 2 and 
also the measures fi 3 and /i 4 are mutually singular in the sense that for certain 
Borel subsets Bi and B 3 the following equalities hold: 

/ Ui(B) = R e fi(B n B 3 ), / 12 (B) = Re /i (B n B {); 

/ 13 (B) = Re /i ( B n B 3 ), / 14 (B) = Re /i(B n B^). 

This decomposition is known under the name Hahn decomposition. In addition, 
we introduce the variation of a complex measure /i. This measure is denoted as 
\/i\. It is the bounded positive measure defined by 

\/i\ (A) = sup |/x(t4j)| : A 3 Aj and Aj n A^ = 0 for k =|= j | . (5.1) 

Here A is a Borel subset of R 1 " and the same is true for the elements of the 
partition Aj, j e N. The norm ||//|| of the complex Borel measure ft, is then 
defined by the equality: ||/r|| = \/i\ (R 1 '). Supplied with this norm M is turned 
into a Banach space. By the Riesz representation theorem the space M can 
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be taken as the topological dual of the space C'o(R I/ ), being the Banach space 
consisting of those complex continuous functions / : HA —► C with the prop¬ 
erty that linx^oo f(x) = 0 . Then C'o(IHT) is a closed subspace of the space 
(7b (HA), the space of all bounded continuous functions on BA, which is a Ba¬ 
nach space relative to the supremum-norm IHI^, given by ||/ = sup x6H „ |/(x)|, 

/ c r,,(»"). 

5.1. Definition. A complex Radon measure on a locally compact space E is 
a complex Borel measure with the property that for every e > 0 and every 
Borel subset B there exists a compact subset I\ a B with the property that 
\n(B\K)\<e. 

5 . 2 . Theorem (Riesz). Let E be a locally compact Hausdorff space, which is a- 
compact, and let A : Co (A) —> C be a continuous linear functional. Then there 
exists a unique complex Radon measure p on the Borel field of E such that 
A(/) = $ fd/a, f e Co (A). In addition, ||A|| = ||p|| = |p| ( E ). If A is positive 
in the sense that f ^ 0 implies A(/) ^ 0, then the corresponding measure /a is 
positive as well and ||A|| = p(A). 


PROOF. For a proof the reader is referred to the literature. In fact the fol¬ 
lowing construction can be used. Let the measures pj, 1 A j A 4 be determined 
by 

pi(0) = sup {Re A(/) : 0 < / < 1 0 , / e C 0 (A)} ; 

p 2 (0) = sup {-Re A(/) : 0 < / < 1 0 , / e C 0 (A)}; 

p 3 (0) = sup {Im A(/) : 0 < / < 1 0 , f e C 0 (E)} ; 

Im(0) = sup {-Im A(/) : 0 < / < 1 0) / e C 0 (A)}, 


where O is any open subset of E. Then it can be shown that, for each 1 A j A 4, 
the set function p.,- extends to a genuine positive Borel measure on E. This 
extension is again called pj. Moreover, 


A (/) 


J / dpi - J / dp 2 + i ^ J / dp 3 - J 


/dp 4 


/ e Co (A). 


For details the reader is referred to, e.p., [ 136 ]. This completes the proof of 
Theorem 5.2. □ 


5.3. Definition. Let p be a complex Borel measure on PA. Then the equality 

p(x) = J exp (—z (x, p)) dp(p), x g IT, 
defines the Fourier transform of the measure p. 

5.4. Proposition. Let p 6 e a complex measure on IT with the property that its 
Fourier transform is identically zero. Then the measure p = 0. 


Proof. Let u be an arbitrary other complex Borel measure on PA with the 
property that \v(x)\ A 1, x e PA. Then the following equality holds: 


J Jl(x)du(x) = J z/(p)dp(p). 
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Hence, 


= sup 

= sup 

= sup 


J f(y)dfi(y) 

J %)d/j(y) 
\ mdv(x) 


:feC b (W)A) 

■ l%)l < 1,2/ e 


: \v(y)\ < 1 ,y 


< 1 


= 0. 


This completes the proof of Proposition 5.4. 


□ 


5.5. Definition. Let <p : R" —> C be a complex valued function. This function 
is called positive-definite if for every n-tuple of complex numbers Ai,..., A n to¬ 
gether with every choice of n vectors ..., n > in R 1 ', the following inequality 
holds: 

2 (« o> - « (t) ) # o, 

j,k=l 

and this for all n e N. 





www.sylvania.com 




We do not reinvent 
the wheel we reinvent 
light. 


Fascinating lighting offers an infinite spectrum of 
possibilities: Innovative technologies and new 
markets provide both opportunities and challenges. 
An environment in which your expertise is in high 
demand. Enjoy the supportive working atmosphere 
within our global group and benefit from international 
career paths. Implement sustainable ideas in close 
cooperation with other specialists and contribute to 
influencing our future. Come and join us in reinventing 
light every day. 


OSRAM 

SYLVAN I A 


Light is OSRAM 


297 



Download free eBooks at bookboon.com 














Advanced stochastic processes: Part II 


Some related results 


5.6. Proposition. Let p be a complex Borel measure on W with Fourier trans¬ 
form fi. Then the following assertions are true: 

(a) the following inequality holds: \p(x)\ ^ ||/r||. 

(b) If p is positive, then the equalities p(M") = p{ 0) = \\p\\ are valid. 

(c) If p, is positive, then the function p is positive-definite. 

(d) The function fi is uniformly continuous. 

Proof. The proof is left as an exercise to the reader. □ 

5.7. Definition. Define for /i and u measures in M, the convolution-product 
p * u via the equalities: 

p * u(B) = J J l B (x + y)dp(x)dv(y) 

= p®v (S' -1 !?) = p ® v {(x, y) <e ML : x + y e B} . 

Here B is a Borel subset of E^ and S is the (sum) mapping S : (x, y) >—> x + y. 
Let x e E". Define thee Dirac-measure S x by 5 X (B ) = 1 b(x), B Borel subset 
of E !/ . Instead of do it is more customarily to write S. Let p e M. Then p is 
defined by p(B) = p(—B ), where B is a Borel subset of R". Let / : E" —* C be 
a complex function, which is defined on all of E !/ . The function / is given by 
f(x) = f(-x), X e E 17 . 

5.8. Definition. A complex Banach algebra ( A , ||-||) is a complex Banach space, 
endowed with a product which is compatible with the norm. The latter means 
that the product (a, b ) i—> ab , a, b in A, which is a bilinear operation, is contin¬ 
uous in both variables simultaneously. In fact it is assumed that ||a6|| < ||a|| ||6|| 
for all a and b in A. 

Examples of Banach algebras are the vector spaces C'o(E 1 ') and C7(E"), equipped 
with the supremum-norm and the pointwise multiplication. Let B(X) be the 
vector space of all continuous linear operators on the Banach space X, supplied 
with the operator norm and the composition as product. Then B(X) is a non- 
commutative Banach algebra. The following theorem says that M, supplied 
with the convolution product, constitutes a (complex) commutative Banach 
algebra with identity 5. Recall that M stands for the space of all complex Borel 
measures on ET 

5.9. Theorem. The normed vector space (M, ||-||) supplied with the convolution 
product * is a commutative complex Banach algebra with identity 6. If p and v 
belong to M, then the following equalities hold: 

p + v = p + v, ap = ap, pTV = pV , p = p. 

Here a is a complex number. 

PROOF. The proof is left as an exercise for the reader. □ 

The Banach space L 1 (R 1 ') can be considered as a closed subspace of M(E"). 
This can be done via the following inclusion-mapping: / i—»• p/, f e L 1 
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Here p,f is the complex measure B i— * f(x)dx, B e B = where B is 

the Borel field of R". Let fif = /i/p — /i/, 2 + i (/i/,3 — /i/,4) be the Hahn-Jordan 
decomposition of the measure //,/. Then the following equalities hold: 

|/i/| (B) = f \f(x) \ dx\ /i/,i(H) = f max (Re f(x),0)dx-, 

JB JB 

, 2 (-B) = max (—Re /(x), 0) dx] Hf^(B) = max (Im f(x), 0) dx] 

Jb Jb 

, 4 (B) = max (—Im f(x), 0) dx. 

Jb 


H. 


/i/. 


5.10. Theorem. Let Coo(R i/ ) be the space of all complex continuous functions 
with compact support. Then C'oo(R l/ ) is a dense subspace of L l { R") for the topol¬ 
ogy of convergence in mean. This means that C'oo(R 1 ') is dense in L 1 (M U ) relative 
to the topology generated by the L 1 -norm: H/l^ = $ \f(x)\ dx, f e L 1 (R 1 '). 

PROOF. Let e > 0 and let / ^ 0 belong to L 1 (W). It suffices that there 
exists a function g e Coo(R 1 ') such that § | f{x) — g{x)\ dx ^ e. Since 

n2 n 

f = sup 2 _n [2 n /J = sup 2 ~ n V 

HE N HE N - = 1 

we only need to show that, for every pair of positive integers j and n, with 
1 < j < n2 n , there exists a function u^ n e C'oo(R 1 ') such that 


I 


- U iA X )| dx S 2^- 


(5.2) 


Because assume that the functions u,j. n , 1 < j < ri'2 n , satisfy (5.2). Then we 
write f n = 2 _n [min(n, /)2 n J and choose neNso large that 


0 < J ( f(x ) - fn (x)) dx < ^e. 


Then we have 

f( x ) 


1 


n2 n 

2_n 2 u tA x ) 

3 =1 




JI/( 


n2 n 


x) - fn{x)\dx + 2 n ^ \l {f>j2 -n } (x) - u jtn (x)\dx 

i=i J 


1 n2™ 1 

< i E + 2-"Vl£_ £ . 

2 4-| 2n 

j=i 


(5.3) 


Let A be the is -dimensional Lebesgue measure. The inequality in (5.2) can be 
proved by employing the following identities: 

A (B) = inf (A({7) :U^B,U open } = sup {X(K) :K^B,K compact } 

(5.4) 

together with Tietsche’s theorem, which, among other things, says that with a 
given open subset U and given compact subset K, with K <z U, there exists a 
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function u e C'oo(BT) with the property that Ik < u ^ lu- The equalities in 
(5.4) follow via an argument about Dynkin systems. 

This completes the proof of Theorem 5.10. □ 

5.11. Proposition. Let f belong to L 1 (W). Then 

lim I | f(x + y) — f(x)\ dx = 0. (5.5) 

y^Oj 

Proof. By theorem 5.10 it suffices to prove (5.5) for / e C'ooOR 1 ')- Such 
a function / is uniformly continuous. Let K be the support of the function 
/ e C'oo(IR 1 '). Fix e > 0 and choose 6 > 0 in such a way that 

X(K + B(S)) sup \f(x + y) - f{x)\ < e. 

xeK,yeB(8) 

Here the symbol B(5) stands for B{5) = SB( 1) = {x e : |x| ^ 5}. Then we 
have 

f I fix + y) - f(x)\ dx < e 

for \y\ < S. So the proof of Proposition 5.11 is complete now. □ 

5.12. Theorem (Riemann-Lebesgue). Let f e L 1 (M 1 '). Then lim f(x) = 0. 


Of course here we write f(x) = ^ exp (—i (x, y )) f(y)dy. 
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Proof. By translation invariance of the Lebesgue-measure we get the equal- 


ity: 


f(x) = i Jexp (—i (x, y )) ( f(y) - f ( y + tt 
From (5.6) the inequality: 


t4 ) W 

(5.6) 

\x\ J J 


dy. 

(5.7) 


f(y)~ f [y + n-r-a 

V \x\ 

A combination of (5.7) and Proposition 5.11 yields the desired result, and com¬ 
pletes the proof of Theorem 5.12. □ 


5.13. Theorem (Stone-Weierstrass). Let E be a locally compact Hausdorff space 
and let A be a subalgebra of Cq(E), which separates points of E and which is 
closed under complex conjugation. That is, if f belongs to A, then f also belongs 
to A. Then A is dense in Cq(E). 


Proof. Let E A be the one-point compactihcation (Alexandroff compacti- 
hcation) and A\ = A © Cl = {/ + A 1 : / 6 A, X e C}. Here 1 is the constant 
function with value 1 and functions f e A vanish in A. The theorem of Stone- 
Weierstrass, applied to the compact Hausdorff space E A results in the desired 
result, and completes the proof of Theorem 5.13. □ 

5.14. Theorem. The set j/ : / e Coo (IT) j is a subalgebra of C'o(lC) that is 

closed under taking complex conjugates. This algebra is dense in C'o(BC) with 
the supremum-norm. 


PROOF. The fact that the set A := j/ : / e L 1 (M !/ )| is a subalgebra of 
C'o(M !/ ) follows from the standard properties of the Fourier transform in combi¬ 
nation with Theorem 5.12. Since / = / it also follows that this algebra is closed 
under complex conjugation. In order to apply the Theorem of Stone-Weierstrass 
we still have to show that A separates the points of IT. To this end take x 0 and 
yo A xq e KC. Then there exists a bounded open neighborhood V in W' such 
that exp (—i (xq, y)) — exp (—i{yo,y)) 4= 0 for y e V. Next consider the function 
/ : y i—> (exp (i (xo, y)) — exp {i (y 0 , y)))v(y), where v is a function in Coo (®T) 
with v ^ ly. Then we see 

f( x o) ~ Kv o) = J | ex P (-i (x 0 , y)) - exp (-* (y 0 , y ))| 2 v(y)dy > 0 . (5.8) 

From (5.8) it immediately follows that A separates the points of KA The asser¬ 
tion in Theorem 5.14 now follows from Theorem 5.13. □ 


In the following theorem we collect some properties of positive-definite functions. 

5.15. Theorem. Let Lp : IT —> C be a positive-definite function. Then pos¬ 
sesses the following properties: 

(a) <p(—x) = <p(x), x e IT; 

(b) W(x)\ < ¥>(0), e M 17 ; 
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(c) \tp(x) — ip(y )| 2 ^ 2(/?(0) (y?(0) — Re cp(x — y)), x, y e R 1 '; 

(d) </?( 0) 2 \ip(x + y)tp(0) - <p(x)ip(y)\ 2 ^ (<p(0) 2 - \v(x)\ 2 ) (<p( 0) 2 - ]<p(y )| 2 ). 


Proof. Fix x and y in M 1 ' and consider the matrices 


MO) ¥>(-a;)\ 

M®) </?(0) ) 


en 


/ p(0) <p{x) <p(y) \ 

(p(x) <p( 0 ) <p(x - y) . 

\t(v) <p(x - y) </?(0) / 


(a) and (b) Since the first one of these two matrices is positive-hermitian it 
follows that: 

<p(—x) = Lp{x) en \t( x )\ < <^( 0 ). 

(c) Since the second matrix is positive-hermitian, we obtain by the choice of the 
constants ai, a 2 and 03 : 


flq = 1, Q-2 


A|y(x) - ip(y )| 
ip{x) - (p{y) 


O3 = —02 


the following inequality for all A e M: 


<p( 0 ) (l + 2 A 2 ) + 2 A \<p(x) — (p(y)\ — 2 A 2 Re <p(x — y) ^ 0 . (5.9) 


The inequality in (c) is a consequence of (5.9). 


(d) The determinant of a positive hermitian matrix is non-negative. So that, if 
the 3x3 matrix 

A A 

AIM (5.10) 

i 1 / 

is positive-hermitian, then we get the inequality 

1 + A/i£ + A^ | A | 2 + |/i| 2 + |£| 2 , 
which is equivalent with 

|e-V| 2 ^ (1-|A| 2 ) (l-|/i| 2 ). (5.11) 

The inequality in (d) then follows from (5.11) by associating the second matrix 
with the matrix in (5.10) and by employing (5.11). 


The proof of Theorem 5.15 is complete now. 


□ 


5.16. Proposition. Let g be a function in L 1 (ML). Then the following equalities 
hold: 

spectral radius of (g) = lim ||g* n ||d = Halloo • 


In the theory of Banach algebras the Beurling-Gelfand formula gives a relation¬ 
ship between the spectral radius and the norm of an element. More precisely, 
let (A, ||-||) be a Banach algebra with unit e. A Banach algebra is a Banach 
space with a multiplication (x,y) >—>■ xy which satisfies the usual axioms of 
distributivity and scalar multiplication. The norm satisfies \\xy\\ < ||x|| • ||y||, 
x, y e A, ||e|| = 1. By definition, the spectrum cr(x) of an element x e A is 
given by a(x) = {A e C : Ae — x f G(A)}. Here G(A) is the group of invertible 
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elements of A: x e G(A) if and only if there exists a (unique) element y e A 
such that xy = yx = e. Then a(x) is a non-empty compact subset of C con¬ 
tained in the disc of radius ||x||: a(x) c {AeC: |A| ^ ||x||}. In fact we have the 
Beurling-Gelfand formula for the spectral radius: 

sup |A| = limsup ||x n || 1/n — inf ||x n || 1 ^ ra , x e A. (5-12) 

Xea{x) n—>00 neN" 


Let A = L 1 (R (/ )®C<5, where <5 is the Dirac measure at zero, with a multiplication 
given by the convolution product: 

(/ + aS) * (g + (35) = f * g + ag + a/3 , /, g e L 1 (BT), a, (3 e C, 

and with the norm given by ||/ + oA || i ll/ILi + |a|, / g L 1 (M^) ,qgC. Here 
/ * g[x ) — \ f{y)g(x — y ) dy. Then A is a commutative Banach algebra with 
unit 5. The spectral radius p(f ) of / e L 1 (BT) is given by the supremum norm 
of its Fourier transform: 


p(/)~ limsup |/“||p-inf ||/“||p 

n—XX) ne ^ 


sup 

xeR 1 ' 


f( x ) 




where f(x) = J e~ lx ' v f(y) dy. The interested reader can find more information 
in Bonsall and Duncan [ 22 ], in Yosida [ 154 ], and in several other places like 
Lax [ 81 ]. 


Proof of Proposition 5.16. For a proof we refer the reader to a book on 
functional analysis with Banach algebras as a topic. Good references are Rudin 
[ 117 ], Theorem 11.9 together with Example (e), and Folland [ 55 ], Theorem 1.30 
combined with Theorem 4.2. □ 
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The following theorem is a very important representation theorem. It will be 
used in Theorem 5.25 and in the continuity theorem of Levy: Theorem 5.42. 

5.17. Theorem (Bochner). Let p : E 1 " —> C be a function. The following 
assertions are equivalent: 

(i) The function p is continuous and positive-definite; 

(ii) There exists a positive Borel measure p op IT such that p = p. 

The Borel measure ft, in (ii) is unique. 


PROOF, (i) => (ii). Define the linear functional A : M —*• C by means of 
the equality: A(v) = ^p(x)du(x), v e M. Define the involution is >—> i) via the 
equality: is{A) = is(—A). Because, by hypothesis, the function p is positive- 
definite we see that the functional A is positive in the sense that K{is * is) A 0 
for all v e M: see inequality (5.26) in Proposition 5.23 further on. By Cauchy- 
Schwartz inequality we then obtain 

1 /Q 

|A(i/)| = | A(u * 5 )| < (A (is * v)) l/2 (A 
< (A (v * v)) 1 / 2 (p(O ) 1 ^ 2 


(by inductionwith respect to n) 


<(a((i s*u)* 2n ^Y <p(0) 

1/2"+ 1 


n +1 


SjA 12 ~ j 


< IM 


l/2 n+1 


V * V) 


\*2 V 


v(0) 


Eji‘2- 


By letting n tend to oo in (5.13) we deduce 


|A(z/)| ^ liminf 


v * v) 


\ *2 V 


1 / 2 ' 


n+1 


¥>(0) 


= yspectral radius of is * hp(0). 


(5.13) 


(5.14) 


By applying (5.13) and (5.14) to a measure v of the form is(B) 
where / belongs to L^IT) we obtain 


I 


p(x)f(x)dx 


< 


spectral radius of / * fp(0). 


Sb f (x)dx, 


(5.15) 


In (5.15) we wrote f(x) = f{—x) and / * g(x) = \f(y)g(x — y)dy, for / and g 
belonging to L 1 (EA). Next we realize that L l (PA), equipped with the L 1 -norm 
and the convolution product *, is a Banach-algebra and that the spectral radius 
of an L 1 -function / is given by the supremum-norm the Fourier transform of /: 
see Proposition 5.16. From (5.15) we infer 


< p(x)f(x)dx 

J 

/*/ 

^ ! 

3 

/A 

f 

cp(0). 

(5.16) 

J 

V 


00 


00 
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Next define Ao : j f : f e L^R") j —> C via the equality Ao(/) = §p(x)f(x)dx, 

f e L 1 (M U ). From (5.16) it follows that the functional A 0 has a unique exten¬ 
sion as a continuous linear functional, which we call again Ao, on the uniform 

closure of the subalgebra j / : / e L 1 (M l/ ) j . By the Stone Weierstrass theorem 

(Theorem 5.14) this closure coincides with C'o(lHT). The Riesz representation 
theorem applies to the effect that there exists a bounded Borel measure p such 
that A 0 (/) = $ f(x)dp{x), f g L 1 (M U ). From this it follows that 


J p{x)f{x)dx = Ao(/) = J f(y)dp(y ) = J p(x)f(x) dx. 


Consequently, p = p. The function p being positive-definite it follows that the 
measure p is positive. This proves the implication (i) => (ii). 


(ii) => (i). Let p be a finite positive Borel measure. Then its Fourier trans¬ 
form p is a uniformly continuous positive-definite function. The proof of these 
assertions is left to the reader. 

The proof of Theorem 5.17 is complete now. □ 


An alternative proof runs as follows: the idea is taken from Theorem 5.10 in 
Lorinczi et al [ 88 ]. We need the following lemmas. 

5.18. Lemma. Let p : l 1 ' -» C 5e a (uniformly) continuous positive-definite 
function, and fix t > 0. Then the function £ i—> e — 2 *1^1 <£>(£) is also (uniformly) 
continuous and positive-definite. 


PROOF. Let (j, 1 A j A n, belong to R", and let Ay, 1 < j < n, be complex 
numbers. Then 


2 X j X ke ^ 4l V(0-Cfc) 

j,k= 1 

1 r n _ 

= 2 \ je ^- y \ k e^yp & - &) e-\ y \ 2 'W dy > 0. (5.17) 

(V 2 ? Tt) M" jtk=1 


The claim in Lemma 5.18 follows from (5.17). □ 

5.19. Lemma. Let if : R" —> C be a function which belongs to L 1 (R"), and let 
V± be a bounded open neighborhood of the origin in R". Put V n = nV\, n e N. 
Let m (14) = S^v n (fi) df = n v m{\ 1 ) be the Lebesgue measure of V n . Then, 
uniformly in x e ML, 


f e^if{f)di 


lim 

n —>00 


v) ' x ^ (t - v) dfi dy 
m(V n ) 


(5.18) 
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Proof. By employing standard properties, like translation invariance and 
the homothety property of the Lebesgue measure, we deduce the following equal¬ 
ities: 


= f 

JR" 

k k\(v n -? 7 ) e ^ ^ (0 dpdrj k k\(nVi-n?7) e ^ ^ (0 dp dr] 


k k e^-^xp (p - v ) dp dr] 
m(V n ) 

kk-/'^ (0 dr > 


m(V n ) 


m(V n 


m(V) 


(5.19) 


From (5.19) we infer 


I 

JR' 




k k V) ' X *P ^ - d) dp dr] 


m(V n ) 


< 


^Vi ^R"\(nVi—nr]) 1^(01 dp dr] 


m(V) 


(5.20) 


Hence, by using the Lebesgue’s dominated convergence theorem the equality in 
(5.18) is readily established. Moreover, this limit is uniform in x e R". This 
completes the proof of Lemma 5.19. □ 


5.20. Lemma. Let rp : R 1 ' —> C be a continuous positive-definite function which 
belongs to L 1 (R"). Then, for all iel 1 ' the inequality e l ^ x xp(p) dp ^ 0 holds. 
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PROOF. Since the function ip is positive-definite and continuous the right- 
hand side of (5.18) is non-negative. So the assertion in Lemma 5.20 follows from 
Lemma 5.19. □ 

5.21. Lemma. Let ip : M" —» C be a continuous positive-definite function which 
belongs to L 1 (IT), and let p be a bounded complex-valued Borel measure on R" 
with Fourier transform jl(x) = e~ lx ' y dpfy). The the following equality holds: 

f VKO MO = 7T~F f f M x ip{C)df > p(x)dx. (5.21) 

JR" [ Z7r J JR" JR 1 ' 

If Lp : IT —► C is an arbitrary continuous positive-definite function, and if p is 
a bounded complex-valued Borel measure on IT, then 


f F(0 df-i(fi) = lim 7 v~tt 7 f f e^e 

JR" TO (, 27 TJ JR" JR" 


and 


J 


<p(£) MO 


< <p( 0 ) sup | p(x) 

xeR v 


(5.22) 

(5.23) 


Proof. From Fubini’s theorem we get 

77Ty 7 f f e^ x 1p(^)d^p(x)dx 

\^) JR" JR" 

= TTv 7 f f f e~ lx ' y dp{y) dx 

\ Z7T ) 

= f f ( 7 f v f e**i,(Oti)e- tay dxi,i(y) 

JR" JR" Vv^ 71 "; JR" J 

= f (ip) {y) dp(y) = [ ip(y)dp(y), 


(5.24) 


where B denotes the Fourier transform with inverse B 1 . The equalities in 
(5.24) imply the equality in (5.21). In order to prove he equality in (5.22) we first 
observe that by Lemma 5.18 the functions of the form £ h-► </? t (£) := e~ 5 *l£l 
t > 0 , are positive-definite and continuous, because (p is so. Applying the 
equality in ( 5 . 21 ) to the function <p t shows 


f <F (0 f e^viOMO 

JR*' 


= li mprv 7 f f \(fi)d^p(x)dx. (5.25) 

do (2vrJ J RI / J K i/ 

The equality in (5.22) follows from (5.25). Finally, the inequality in (5.23) 
follows from (5.22) and Lemma 5.20. So the proof of Lemma 5.21 is complete 
now. □ 

Second proof of Theorem 5.17. Let M = M(1T) be the collection of 
bounded complex Borel measures on Rl n “, and consider the functional 


A v : p j <p(0 dpff), pe M. 
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Then can be extended to the uniform closure of the collection [jl : /i e M} 
such that |A V (/)| ^ </?( 0 ) jfW^ for all / in this closure. This closure contains 
all constant functions and all continuous functions on M !/ which tend to 0 at oo. 
By the Riesz representation theorem there exists a positive measure on the 
Borel field of M" such that 

</?(£) d/i(£) = jj,(x) dp v (x) = e~ l ^' x d/i(£) dp^fx) 

Jm*' 

= f f e~* x dfi^x) dn(g) = ( p^)dp( 0 , 

JR" JR 1 ' JR 1 ' 

for all p e M. It follows that p(£) = p v (£). This completes the proof of the 
theorem of Bochner: Theorem 5.17. □ 

5.22. Lemma. Let p : M" —* C be a continuous function, and let p be a complex 
Borel measure on M" with compact support. So \p\ (W\K) = 0 for some compact 
subset K of ML. Then 

n 

p(x - y)dp(x)dp(y ) - afd k p (xj - x k ) 

j,k=l 

where the infimum is taken over all aj e C, Xj e K 0) 1 ^ j ^ n, n e N, and 
where K 0 is the smallest compact set K with the property that \p\ (M?\K) = 0. 




Proof. Fix e > 0, and choose a partition ( Uj : 1 < j < n) of K 0 with the 
property that 

\ip(x-y)-ip (x 1 -y ')| < ———j, 

W (K 0 ) 

x , x' 6 Uj and y, if e U k , and write aj = n(Uj). Then for x 3 e U r; 1 A j A n, 
we have 

<F(r - y)dp(x)dp(y) - J] afd k p (xj - x k ) 

JJ j,k =i 

Y, (<p(x -y)-<P (Xj - Xk)) dp(x)dp(y) 

i,k=i M Ju k 

2 \<p(x-y)-p(x j -x k )\d\p\(x)d\p\(y) 

j,k= 1 ^ U 0 J U k 


< 


H (Ko 


■ V d\p\(x)d\p\(y) 

j: k =i Ju k 


e. 


This proves Lemma 5 . 22 . 


□ 


5.23. Proposition, (a) Let p : IT —> C be a continuous function. The follow¬ 
ing assertions are equivalent. 


(i) The function p is positive-definite; 

(ii) For every function f e C'oo(E 1 ') the inequality §p(x)f * f(x)dx 5= 0 
holds; 
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(iii) Every Borel measure p with compact support satisfies the inequality: 

ip(x)d (p * p) (x) ^ 0. (5.26) 

(b) If ip is positive-definite and if p is a bounded complex Borel measure on K", 
then inequality (5.26) in (iii) also holds. 

Proof, (a) (iii) =^> (ii). Choose p of the form pfB) = § B f(x)dx, with 
/ e C' 00 (M i/ ) fixed. 

(ii) =^> (i). Let p be of the form p = Yj)=i a ]^x, ■ Approximate the de Dirac 
measures 8 Xj by measures of the form B h-> J fj N {x)dx in the sense that 

r> n 

ip{x)d (p N * pfjfi) (x) = ip{x)d (p * p) (x) = ajd k p (xj - x k ). 

J j,k =i 

Here the measure p^ is dehned by Pn{B ) = YTj =i a j Ib f:i.N( x )dx, B e !B(M"). 


lim 

JV->oo J 
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(i) => (iii). Let p be a Borel measure of compact support. Then there ex¬ 
ists a sequence of measures (/pv : N e N),where every p^ is of the form pn = 
Y!j=\ a j,N8x j>N and where 


p>(x)d (/i * ji) (x) 


lim 

TV—>oo 



Pn) (x) 


N 

lim V a,j N^k,NP> ( x j,n — x k,N ) ^ 0. 

iV —>00 1 

j,k=l 


That such a sequence of measures exists (jin : N e N) follows from Lemma 5 . 22 . 


(b) Let (K m : m e N) be an increasing sequence of compact subsets of R" such 
that R" = (J“=i K m and such that K m a interior (K m+ 1 ) for all meN. Since, 
in addition, 

<p( x )d (/i * jl) (x) = Jlim^ [y(x)d (il Km p) * ((lx m h))) 0*0 
assertion (b) follows from the results in (a). 

This completes the proof of Proposition 5.23. □ 

5.24. Definition. The weak topology (or Bernoulli topology) on M is the lo¬ 
cally convex topology a (M, Cb(Mk)). Let po £ M. So that every a (M, C'b(R 1 '))- 
neighborhood of po contains a neighborhood of the form 

n r 

P| j p e M : 

Here, the functions /],..., f n are bounded and continuous, and the numbers 
ey,... ,e n are strictly positive. A net (p a : a £ A) M converges to the measure 
p for the topology a (M, C'^R 1 ')) if lim Q $ fdp a = $ fdp for all / g C^R"). 


1 


fjd (p - p 0 ) 


< e, 


(5.27) 


We write p = weak- lim Q p a . The space M can also be supplied with the vague 
topology. This is the locally convex topology a (M, C'oo(R !/ ))- For the vague 
topology the functions fi, ■ ■ ■ , f n in (5.27) are required to belong to G'oo(R") 
and the net ( p a : a £ A) converges to p e M provided lim a $ fdp a = fdp for 
all / e C'oo(R iy ). We write p = vague- lim a p a . 

Let M + := {p £ M : p ^ 0} and let 

CP := CP(M. U ) = [ip g Cb(R") : <p positive-definite}. 

The following theorem expresses the fact that the set M + , endowed with the 
weak topology and CP, endowed with the compact-open topology T, are home- 
omorphic. The compact-open topology is also called the topology of uniform 
convergence on compact subsets of M'L So that a net (<p a : a £ A) converges to 
(p, if liniQ, sup xeK \<p a (x) — tp(x )| = 0 for every compact subset K of R". 

5.25. Theorem. The Fourier transform p > ji, p e M + , is a homeomorphism 
from 

(M + ,cx(M + ,a(K l/ ))) onto (CP,7). 
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PROOF. Let (p Q : a e A) be a net in M + that weakly converges to y e M + 
relative to the weak topology. We will prove that the net (y a : a e A) converges 
uniformly on compact subsets to y. Fix e > 0. Then choose 5 > 0 in such a 
way that 5 (3 + //(R")) < e and choose a function f e C'oo(R 1 ') such that 

0 < / < 1 and f (1 — f)dy < S. 

Since weak- lim y a = y there exists o;o e A such that 

y a (M. u ) = J 1 dy a < J ldy + 1 = /r(R") + 1 en f (1 — /) dy a < 5 

for all a ^ a 0 . Define the zero-neighborhood V by 

V = {x e R" : |1 — exp (—i (x,y))\ < 5 : for all y e supp(/)} . 


Then for those a e A and those x\ and X 2 e R" which satisfy a ^ cro and 
xi — X 2 £ V the following inequalities hold: 


\y a (xi) - y a (x 2 ) | < J |exp (—i {x u y)) - exp (-* (x 2 ,y))\dy a (y) 

< J I 1 - exp ( Xl ~ x2 ’ ^))l f(y) d ^(y) 

+ J |1 - exp (-* (xi - x 2 , y ))| (1 - f(y))dy a (y) 

< d J f(y) d Va(y) + 2 J(1 - f(y))dy a (y) 

< 5 (y(M. u ) + 1) + 2h < e. (5.28) 


By (5.28) it follows that \y(xi) — y(x 2 )\ < e for X\ and x 2 e R" for which 
x 1 — x 2 e V. Next choose a compact subset K in RF Then there exist y±,, y n 
in IRF such that K Q u;=i (%■ + V) and thee exist ctj e A, 1 ^ j ^ n, such that 

\V>a{Vj) ~ HVj )I < e for a ^ ctj, j = 1,..., n. 

Then choose a' e A in such a way that a' ^ a 3 for j = 1 ...., n. For x e y 3 + V 
and a ^ a' we get 

\fia{x) ~ y(x)\ \y a {x) - fiaiVj )I + I MVj) ~ U{Vj )I + I KVj) ~ fii x )\ < e 
and hence 

sup \y a (x) — y(x)\ < 3e. 

xeK 

This proves that the Fourier transform is continuous for the indicated topologies. 
Conversely, suppose that the net (y a : a e A) converges uniformly on compact 
subsets to y. Then we will show the following two equalities: 


(a) lim y a (M. u ) = y(W); 

(b) lim ^(p(x)dy a (x) = §(p(x)dy(x) for all functions (p e C^R"). 

From Theorem 5.26 below it then follows that weak- lim [i a = [i. The equality 
in (a) follows from: 

lim [i a (M 1 ') = lim p, a ( 0 ) = p,( 0 ) = [i (K 1 '). 
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Let e > 0 be arbitrary and let ip e Coo(®T)- Choose a function / e C'oo(IKT) with 
the property that 


<P~f 


< 


oo 2 fj, (R") + 1' 


Then we infer 


J (p(x)d/j, a (x) — J tp(x)dn(x ) 

J (<p{x) - f(x)^j dfj, a (x) + J f(x)d (n a - /i) (x) 


< 


< 


< 


+ 


f(x) - (p(x)) d//,(x) 


2p(R .") + 1 

The inequality 


(/h* (R") + fJ,(R v )) + J | %{x) - fi(x) \\f(x)\dx 
+ sup \fj, a (x) — fi,(x)\ \f(x)\dx. 

a;esupp(/) J 


<P~f 


e(/i a (W) + fi(W)) 


(5.29) 


lim sup 


J (p(x)d (fi a - n) (x) 


< e. 


follows from (5.29). As a consequence we see that (b) is proved now. Together 
with Theorem 5.26 which follows next this completes the proof of Theorem 
5.25. □ 
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5.26. Theorem. A net (ji a : a e A) in M + converges weakly to p e M + if and 
only if the net (ji Q : a e A) converges vaguely to p and if 

lim p a (M") = [i (M"). (5.30) 


PROOF. The weak topology is stronger than the vague topology and from 
weak convergence the equality in (5.30) also follows. Hence, the indicated con¬ 
ditions are necessary. Conversely, let a net (p a \ a e A) converge vaguely M + 
to /i and assume that (5.30) is satisfied. We will prove that /a is the weak limit 
of the net (p a : a e A). Therefore pick / e Ci } (W') and e > 0 arbitrary but 
fixed. Choose a compact subset K such that p (R"\W) < e. In addition, choose 
a function h e Coo(lHC) in such a way that Ik < h ^ 1. By these hypotheses 
the following (in-)equalities hold: 


lim J(1 — h)dp a = J*(l — h)dp < /i i 


*\K) < * 


and also 


lim 


J fhdn a = J 


fhdji. 


fhdn a = 

Hence, there exists an a 0 e A such that (for a ^ a 0 ) 

J fhd (//,« - n) 


^(1 — h)d/j, a < e 
But then for a ^ ao we get 
fd (fj, a - n) 

< J fhd (/j, a - 

^ e (1 + 2 ||/|| c 


< e. 


I 


- v) 


+ 


J/(I - h)d/j, + f/(l - h)d/j, 0 


which shows that lim ^ fdjj a = J fd/i. 

This completes the proof of Theorem 5.26. 

5.27. Corollary. The following assertions are true: 


□ 


(a) The set CP is a convex cone, which is closed for the topology of uniform 
convergence on compact subsets. 

(b) With p the functions Tp and Re ip also belong to CP. 

(c) If pi and p 2 belong to CP, then the same is true for the product piP 2 - 

(d) For every y e M" the function x > exp (—i (x, y)) belongs to CP. 
Convex combinations of such functions belong to CP. 


Proof. The proof is left as an exercise for the reader. 


□ 
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5.28. Definition. A function i/; : —> C is called negative-definite if for all 

n e N and for all complex numbers ai, ..., a n and for all vectors ..., x ^ 
in M 1 ' the inequality 

n 

y djCLk (fp (x^) + ip (x( k l) — ip (x^ — x ^) ^ ^ 0 (5.31) 

j,k=l 

holds. The symbol CN denotes the collection of all continuous negative-definite 
functions on M'h If ip belongs to CN, then the same is true for ip andRe ip. The 
collection CN is a convex cone. If ip belongs to CN, then ^(0) > 0 and 
ip(x) = ip(—x) for all x e M l/ . A function ip is negative-definite if and only if ip 
has the following properties: 

(1) ^(0) >0; 

(2) For every lei" the equality ip(x) = ip(—x ) holds; 

(3) For every n e N and for every n-tuple of complex numbers oq,..., a n , for 
which Xj=i a j = 0, and for all vectors x^ ,..., x ^ in M 1 ' the following 
inequality holds: 

n 

y ajUkip (x ^—^ o. 

j,k=l 

If the function ip is negative-definite, then so is the function ip — ip ( 0). If cp is 
positive-definite, then the function <p(0) — ip is negative-definite. 

The following theorem establishes an important connection between negative- 
and positive-definite functions. 

5.29. Theorem (Schoenberg). A function ip belongs to CN if and only the 
following two conditions are satisfied: 

(i) ^(0) ^ 0; 

(ii) For every t > 0 the function exp (—tip) is continuous and positive- 
definite. 

Let ip be a negative-definite function. Then, by Bochner’s theorem together 
with the theorem of Schoenberg, there exists for every t > 0 a sub-probability 
measure /i t on the Borel field of M !/ such that fit = exp (—tip). We return to this 
aspect when we discuss the notion convolution semigroup of measures. 

PROOF. First suppose that ip belongs to CN. Let x^,..., x^ belong to M". 
Write dj : k = ip (x^+ip (x^^—ip ( xV — x^y Then the matrix with entries a jk 
is positive hermitian. But then the matrix with entries exp (a^j.) is also positive 
hermitian. Let ai,... ,a n belong to C and write a) = exp (—ip (aV)) aj. Then 
we see 

n n 

y exp (-ip (x (j) - x (fc) )) ajd k = y exp(a jtk )a , j a' k ^ 0. (5.32) 

j,k= 1 j,k= 1 
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From (5.32) it follows that the function exp (— ip) is then positive-definite. The 
same procedure can be repeated for the function tip. Conversely, if (i) and 
(ii) are satisfied, then, for every t > 0, the function ip t := 1 — exp (—tv) = 
1 — exp (—tip(0)) + exp (—tip(0)) — exp {—tip) is negative-definite. But then the 

function ip is negative-definite as well, because ip = lim —. Since 

tio t 

1 - exp(-tip(x)) 

w(x) = -7-, 

$ 0 ds exp (— s'ip(x)) 

for t > 0 but small enough, we see that the function ip is continuous at x. 

So the proof of Theorem 5.29 is now complete. □ 

5.30. Definition. A family of Borel measures (/j, t : t ^ 0) with the following 
properties: 

(a) (IKC) < 1 for t > 0; 

(b) ji s * f i, t = Hs+t f° r a U s and t ^ 0; 

(c) lim 40 $ fdfH = $ fdfj , 0 = /(0) = S 0 (f) for all / e Coo (R*'); 

is called a (vaguely continuous) convolution semigroup of measures on W'. 
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The following theorem says that a vaguely continuous convolution semigroups 
is in fact everywhere weakly continuous. 

5.31. Theorem. There exists a one-to-one correspondence between vaguely con¬ 
tinuous semigroups of measures and negative-definite functions. 

(a) If (fi t : t 5= 0) is a vaguely continuous convolution semigroup of mea¬ 
sures, then there exists a unique continuous negative-definite function 

such that fi t = exp (—tip), for all t ^ 0. 

(b) Conversely, if ip is a negative-definite function, then there exists a 
vaguely continuous convolution semigroup of measures (ji t : t ^ 0) such 
that fit = exp (—tip) for all t ^ 0. Of course, this semigroup is unique. 


Proof, (a) Define, for t > 0, the function ip via the equality 


ip = 




\ 

Jo 


(5.33) 


f s ds 


Since fi s fit = p»+t we see that ip does not depend on the choice of t. Put 
g(t) = ^ 0 fi s ds. Then we see that g( 0) = 0 and g(t)ip + g'(t ) = 1, and hence 
1 — exp (—tip) 


g(t) = 


ip 


From the latter it follows that fi t = exp (—tip). The 


Theorem of Schoenberg (Theorem 5.29) implies then that the function ip is 
negative-definite. The functions fi s , s ^ 0, are continuous. So the same is true 
for ip. 


(b) Since ip is a negative-definite function, the functions exp (—tip) are positive- 
definite by the theorem of Schoenberg. The theorem of Bochner (Theorem 
5.17) yields the existence of sub-probability measures (p t : t ^ 0) such that f t = 
exp (—tip). Since 

lim/q(0 = lim exp (—tip(l;)) = 1 = p 0 (£) 

Theorem 5.43 in the next section implies that lim^o S fdfit = f( 0) for functions 
feCoo(W). 

The proof of Theorem 5.31 is now complete. □ 


5.32. Remark. In the proof of Theorem 5.31 part (a) there is a problem if 

t i 

the integral fi s ds vanishes somewhere. However, notice that lim - I fi s ds = 

/do pointwise. It follows that, certainly, for t = t(p) > 0 small enough, the 
expression ^fis^ds 0. This fact can be used to circumvent this problem. 


5.33. Remark. In the proof of Theorem 5.31 part (b) Theorem 5.43 of the 
next section was employed. This can be averted as well. Therefore con¬ 
sider /, with / 6 ZdQR"). Then lim ti0 $ f(x)dp t (x) = lim ti0 $ f(x)fi t (x)dx = 
\f(x)dx = f( 0) = \ fdfiQ. By the theorem of Stone-Weierstrass from this we 
obtain lim f(x)dfi t (x) = f( 0) = $ f(x)dp 0 (x). 
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5.34. Proposition. Let (ji, t : t ^ 0) be a vaguely continuous semigroup of Borel 
measures on MT Suppose that all these measures are probability measures. Then 
the following assertions hold: 

(a) weak- lim^ tO;t>0 g t = /P„ for all t 0 ^ 0; 

(b) lim t ^ to sup x6R , | J/(x - y)dy t (y) -\f{x- y)dy to (y)\ = 0 for all t 0 e 
[0, go) and for all functions f e Co(PT). 

Proof, (a) First we look at 

Ik (IT) - p, to (IT) = exp (-t^(O)) - exp 0)). 

It follows that 

lim y t (R") - pL to (M 1 ') = 0. 

t^t 0 

For the same reason we see that 

lim //*(£) = lim exp = exp (~t 0 fi(f)) = MO- 

t^t o t—>to 

By using theorem 5.43 in the next section we see that 

weak- lim u t = u tn . 
t^>t 0 ,t>o 

Of course, in this proof the function if denotes the negative-definite function 
from Theorem 5.31. 

(b) Let g e Cq (IT) be of the form g = f with f e L 1 (W / ). Then we see 

f(x - y)dg t {y) ~ J fix ~ y)dy to (y) 

(M~ z ) ~ M~ z )) ex P (-* (x, z)) f(z)dz 
^ J | exp (-tfi(-z)) - exp (-t 0 fi(-z))\ \f(z)\dz 

< J |exp (- 1 1 - f 0 | 'if(-z)) - 1| \f(z)\dz. (5.34) 

The assertion in (b) now follows from (5.34) together with the theorem of Stone- 
Weierstrass, and completes the proof of Proposition 5.34. □ 

5.35. Proposition. Let (ji t : t ^ 0) be a vaguely continuous semigroup of prob¬ 
ability measures on the Borel field of ML. Define for every n-tuple ti,... ,t n with 
0 ^ t\ < • ■ ■ < t n , the probability measure P on the Borel field of ( R") n via 
de formula 

= Ht! ® fki-t! ® • 1 ((®i, •• • ,Xn) 6 (R")" :V n (xi,...,x n )e B) 

dn tl (xi )... dptn-tn-^Xn^B iV n (xi, .. • ,x n )) , (5.35) 
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where B is a Borel subset of (R I ') n and where V n : (R I ') n —* (R 1 ')™ is the linear 
mapping given by: V n : (xi,X 2 , ■ ■ ■, x n ) >—> (aq, x\ + x?, ■ ■ ., x\ + • • • + x n ). Then 
the family 

{((R") n , ® (R") n , ¥ tl _ tn ) : (B, ..., t n ) e [0, oo)", n e N} 
forms a projective system of probability measures. 


Proof. Let B e £ (R") n and let B' e T> ((E l/ ) n+1 ) be defined by 

B' = {(zi,.. .,z n+ 1 ) e (R") n+1 : (zi,... ,z k ,z k+2 ,.. .,z n+ 1 ) 6 B) . 

Let ti < ■ ■ ■ < t k < s < tk+i < ■ ■ ■ < t n be an (n + l)-tuple of increasing times. 
We have to prove the following equality: 

Since the vector 


V „+1 (yi,s/ 2 , • • •, y n +i) := (yi, yi + j/ 2 , • • •, yi + ■ • • + y n +i) 

belongs to B' if and only if the vector 

(yi5 Vl + 2/2) • • • , Vl + • • • + Vk-, Vl + • • • + Vk+ 2 ) • • • ) 2/1 + ' ' ' + 2/n+l) 
belongs to B, we get what follows: 


E 


^1 ? • • • )t k 5^5^A: + 1 j • • • 


(B') 


l^ti ® * * * ® ^t k —t k _i ® l^s-tk ® !^t k+1 -s ® ' ' ' 

® l^tn-tn-l { (Z/l 5 • ‘ * 5 2/n+l) ! ^Ti+1 (Z/l ? * * * 5 2/n+l) ^ B } 

= • • • J" dt*>t k —t k _i (jjk) ^ dfA s -t k (y) J" dfit k+1 - s [z) ^ d/j J t k+2 ~t k+1 (z^ 2 )... 

J" dnt n —t n -1 (^n+l (?/l5 • • • lUkiUi z k+ 2? • • • ? ^n)) 

= Jd^i(yi) • • • f d ktk -t k _AVk) J dfj, a - tk (y) f dtM k+1 - a (z) J d ktk+2 -t k+1 (z k+2 )... 

^ d k t„-t n -i (firi (2/l 1 • ■ ■ lUkiU T Z, Zfc+2> • • • i ^n)) 


(apply Fubini’s theorem, integrate relative to (j, s - tk ®(M k+1 -s and use the equality 
lg(y + z)d ku (y)dn v {z) = lg(z k+ i)d ku+v {z k+ i)) 


= J d ktl (yi )... J d[A tk -t k _ 1 (yk) J diH k+1 -t k (zk+i) J d ktk+2 - tk+1 (z k+2 ) 

. . . ^ dfltn-tn-liZn) 

1 B (Vn (^/lj • • • 1 Vk) %k+ 1 ? • • • 5 ^n)) 

- J dlkiivi) ••• J s /1 + •••+&.) 


-Ph.fc.(S). 
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This proves the required equality in case 1 ^ k < n — 2. The other cases, which 
are t n _2 < s < t n - 1 , t n - 1 < s < t n , t n < s and T > s, are left as an exercise for 
the reader. 

So the proof of Proposition 5.35 is complete now. □ 

5.36. Proposition. Let (ji t : t ^ 0) be a vaguely continuous semigroup of prob¬ 
ability measures on the Borel field of ML. Define, for every n-tuple ti,... ,t n the 
probability measure where t\ < • • • < t n , as in Proposition 5.35. Then 

there exists a unique probability measure P on the product field of (PT)^’ 00 -* such 
that 

P((X*(«„)) e B) = P tl . ,JB), 

for all Borel subsets B of (R 1 ')"'. Likewise there exists, for every x e R", a 
unique probability measure P x on the product field of (PT)!- 0 ’ 00 -* such that 

P* ((Xih),..., X(t n )) eB)= P ((x + X(h), ...,x + X(t n )) e B) 

= J dp tl (x 1 ) 0 • • • 0d/i* n _t n _ 1 (a: n )l s (x + x u ..., x + x 1 + ... + x n ), 
for all Borel subsets B of (R") n . 

Here the state variable X(t) : (R l/ )^ 0,oo ' ) —> R 1 ' is defined by X(t)(cu) = uj(t), 
where u belongs to the product (R 17 )'- 0 ’ 00 ^. 
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Proof of Proposition 5.36. Apply Kolmogorov’s extension theorem. □ 

5.37. Theorem. Let (/i t ■ t A 0) be a vaguely continuous semigroup of prob¬ 
ability measures on the Borel field of ML. Define for every n-tuple ti,...,t n 
the probability measure P tl! ... )tn; where t\ <■■■< t n , as in Proposition 5.35 
and let M x , x e PA, be the unique probability measure on the product field of 
Cl = (M I ')[°’ G0 - ) such that 

= c?/ij 1 (x\) • • • dutn-tn -1 ( X n)^-B (x + x li • • • ) X + X\ + ... + x n ). (5.36) 

Let 37, be the a-field on Cl generated by X{u), 0 < w < s. For t > s the 
variable X{t) — X ( s ) is independent of 37, and X{t) — X (s) possesses the same 
M x -distribution as X(t — s) — x, which is gt-s- 


Proof. Fix t > s, let / : (]R I ') n —> M be a Borel measurable function, and 
suppose that 0 ^ si < • • • < s n = s. Let g : M 1 ' —> M be another bounded Borel 
measurable function. Then the following equalities hold true: 

E (/ (V( Sl ),.... X(sJ)g (X(t) - X(s))) 

- J dgnfxi) ■ ■ ■ J ) \ 'Ifii-JTjf (xi ,..+ x„)g(x) 

- E (/ (X(s ,) y ..., X(s„))) E (g (X(t) - X(s))). 

Now let H be the vector space of dy-measurable bounded random variables Y 
with the property that E (Yg(X(t) — X(s))) = E (K) E (g(X(t) — X(s))). Then 
H satisfies the hypotheses of Lemma 5.100. Whence, H contains all bounded 37,- 
measurable random variables. Since, in addition, the function g is an arbitrary 
bounded continuous function, it follows that the state variable X(t) — X(s) is 
independent of 37*. This completes the proof of Theorem 5.37. □ 

5.38. Theorem. Let (r2,3 r , P) be a probability space and let (X(t) : t ^ 0) be a 
family of state variables with state space PA. Assume that these state variables 
are measurable relative to the a-fields 3 r and 23 (M"). Suppose that 

limE [f(X(t))] = /(0) for all f e Coo (R"), 

and also that for every t > s the variable X(t) — X(s) is independent of the a- 
field a (X(u) : 0 < u ^ s) and that X(t) — X(s) possesses the same distribution 
as X(t — s ). Then the mapping B >—> pit(B) : = P(X(f) e B) defines a vaguely 
continuous semigroup of probability measures on RA 


Proof. It is clear that every measure is a probability measure is on 
the Borel cr-field of RA Since J fdp, t = E(/(X(f))), for / e Coo (R"), the 
equality limqoE(/(X(f))) = /(0), entails that the family (/q : t Y t) is vaguely 
continuous at 0. The convolution property still has to be proved. It suffices to 
prove that g s {fi)fi t {fi) = fi s+t {fi) for all s and t ^ 0, and for all (e!7 To this 
end consider 

exp (-i{£,x))dfji a (x) exp (~i {£,y)) d/i t (y) 
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= E (exp (-i <£, X(s)))) E (exp (-* <£, X(t)))) 


(the variable X(t) has the same distribution as X(s + t) — X(s)) 

= E(exp(-z (f, X(s))))E(exp(-i (£,A(s + t) - X(s)))) 


(A(s + t) — X(s) does not depend on X(s)) 

= E (exp (-* <£, X(s) + X{s + t)~ X{s)))) 
= E(exp(-i (£,X(s + t)))) = p s+t (0- 


Since 0 = X(0) — X(0) it follows that /v, 0 has the distribution d 0 . This proves 


Theorem 5.38. 


□ 


5.39. Definition. Let (fl,T, P) be a probability space and let the mapping 
X : (■ t,ui ) i—> X(t,cu) = X(t)(oj) satisfy the hypotheses mentioned in Theorem 
5.38. (So that for t > s the state variable X(t) — X(s) does not depend on the 
u-field a (X(u) : 0 < u < s) and X(t) — X (s) possesses the same distribution 
as X(t — s); moreover, the equality linpqo E (f(X(s))) = /(0) holds for all 
/ e Co (PT)). Then the process X is called a Levy-process, that begins at 
A'(0) = 0. 

Important Levy-processes are the Poisson process with jumps 1 and the Brow¬ 
nian motion. The one-dimensional distributions of a Poisson process X (with 
jumps 1 and of intensity A) are given by 



For details on Poisson processes see Subsection 5.4 in Chapter 1. The Brownian 
motion B (with drift 0, intensity I and which starts in 0) possesses as one¬ 
dimensional distributions: 



For more details on Brownian motion see the Section 4 in Chapter 1 and Sec¬ 
tion 3 in Chapter 2. In addition, see Chapter 3. A Levy-process with initial 
distribution p is a family of T-T-measurable mappings X(t) : Q ^ M l/ such that 
A(0) has the distribution p, and such that the process t >—► X(t) — X(0) is a 
Levy-process that starts at 0. If the initial distribution p = 8 X , then it said that 
the process X starts at x. If X = (X(t) : t 5= 0) is a Levy-process that starts 
at 0, then (x + X(t) : t ^ 0) is a Levy-process, which starts at x. The Poisson 
process Xj (with jumps 1 and intensity A) which starts at j e hi possesses as 
marginal or one-dimensional distributions: 



Thus the distributions of the processes (. Xj(t ) : t ^ 0) and (j + X(t) : t ^ 0), 
where X is the Poisson-process which starts at 0, are the same. The Brownian 
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motion B x (with drift 0, intensity I and which starts at x) possesses the following 
one-dimensional distributions: 


P (B x (t) e B) 



dy. 


5.40. Definition. Let E be a locally compact Hausdorff space and let 

{P{t) : t > 0} 

be a family of linear operators of Cq{E) to the space L G0 (£', £). Here £ is the 
Borel field of E. This family is called a Feller semigroup, or Feller-Dynkin 
semigroup provided it possesses the following properties: 


(i) semigroup-property: P(s + t) = P(s)P(t) and P( 0) = /; 

(ii) positivity preserving: f ^ 0, / e Cq{E), implies P(t)f ^ 0; 

(iii) contractive: 0 < / < 1, / e Co(E), implies 0 < P(£)/ < 1; 

(iv) continuity: lim t m [P(t) f 1 (x) = fix) for all f e Cn(E) and for all x e E: 

(v) invariance: P(l)C a (E) J C,(E) for all t > 0. 

In the presence of (i), (v) and (iii) assertion (iv) is equivalent with 
(iv') lim^ to , t>0 || P(t)f - P(to)/|| 00 = 0 for all / e C 0 (E). 
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5.41. Theorem. Let (n,T, P) be a probability space, and ( X(t ) : t ^ 0) be a 
family of state variables with state space R". Suppose that these state variables 
are measurable relative to the a-fields T and 23 (R"). In addition, suppose that 
lim t j 0 E ( f(X(t ))) = /(0) for all f s Coo (R") c^d also that for every t > s the 
variable X(t) — X(s) does not depend on the a-field a (X(u) : 0 < u ^ s), an d 
this for all t > s ^ 0. Moreover, by hypothesis, the variable X(t) — X(s) has 
the same distribution as X(t — s). Define the operator P(t ) from L 00 (R") to 
itself by [P(t)f] (x) = E (/ (x + X(t))), f e L 00 (R 1 '). The restriction of P(t) to 

Co (E") leaves the space Co (R") invariant, and the family jp(£) t ^ 0 j 

is a Feller semigroup (also called a Feller-Dynkin semigroup). 


PROOF. It is clear that every operator P(t) is contractive and positivity 
preserving. It is also clear that linpjo [P{t)f] ( x ) = f( x ) for a U x e R^ and for 
all / e C'o(E^). We still have to prove the invariance property. Let f = g, 
where g belongs to L 1 (R"). Then we obtain 

[P(t)f] (x) = E (/ (x + X(t))) = E (g(x + X{t))) 


- J exp (-! ((, x)) E (exp (~i(£,X(t))))g(£)d£. 


(5.37) 


By the lemma of Riemann-Lebesgue (Theorem 5.12), the equalities in (5.37) 
imply the equality 

lim [P(t)f] (x ) = 0. 

X—>CO 

The continuity of the function P(t)f is clear as well. As a consequence, P(t) 
maps the space {g : g e L 1 (R^)} to Co (E y ). The theorem of Stone-Weierstrass 
implies that the space {g : g e L 1 (R")} is dense in Co (R") for the uniform 
topology. Because of the contractive character of the operator P(t) it then 
follows that P{t) leaves the space Co (PA) invariant. In order to finish we prove 
the semigroup-property. Again we take the Fourier transform g of a function 
g g L 1 (R") and we consider 

[P(s + t)g] (x) 

= E (g(x + X(s + t))) 

= f e^E (exp (-i <£, X(s + t)))) g(f)df 

= J e-^E (exp (—i (£,X(s + t) - X(s))) exp (-* (£, X(s)})) g(0 d £ 

(the variable X(s + t) — X(s) is independent of W(s)) 

= J e^E (exp H <£, X(s + t) - X{s)))) E (exp (-* <£, X(s)») g{f)df 
(the variable X(s + t) — X(s) has the same distribution as X(t)) 

= J e-‘<«E (exp (-t (f, X(t)))) E (exp (-i ((, 

= E (cu — E (a/ — g(x + X(s)(u) + X(t) (a;')))). (5.38) 
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The semigroup-property then follows from (5.38) together with the Theorem of 
Stone-Weierstrass which, among other things, implies that the space 

{g-.geL 1 (IT)} 

is dense in Co (M !/ ) for the uniform topology. 

This completes the proof of Theorem 5.41. □ 


2. Convergence of positive measures 

We begin with the continuity theorem of Levy. 

5.42. Theorem (Levy). Let (ji n : neN) be a sequence of bounded positive Borel 
measures on IT. Assume that there exists a function p : IT —> C, which is 
continuous at 0, such that 

lim jl n {x) = p(x) 

n—> oo 

for all reT. Then there exists a bounded positive Borel measure such that 

weak- lim ji n = ft. 

n—>oo 


Proof. The function p is a point-wise limit of positive-definite functions 
and so it is itself positive-definite as well. Since the function p is continuous 
at 0, inequality (c) in Theorem 5.15 implies the continuity of p. By Bochner’s 
theorem (Theorem 5.17) there exists a positive bounded Borel measure ft such 
that 

lim p n (x) = p(x) = fi(x) (5.39) 

n —>oo 

for all x e MC Next, let / be an arbitrary function in Coo (IRC). Then we see 


lim sup 

n —>oo 



= lim sup 

n—> oo 

C lim sup 

n—> oo 


II 

I 


f(x) (p n (x) ~ p,(x)) 
\f(x)\ | fi n (x) ~ fi(x)\ 


dx 

dx. 


(5.40) 


In view of (5.39) we see that the integrand in (5.40) converges pointwise to 0. 
Write c = sup neN (/t n (0) + jfi(O)). Then c is finite and \f(x)\ \p, n (x) — ju(x)| is 
dominated by the Id-function c\f(x)\. By the dominated convergence theorem 
(Lebesgue) it follows from (5.40) that 

lim sup 

n—► oo 

Since the subspace |/ : / e Coo (® l/ )| is uniformly dense in Co (IT) (see The¬ 
orem 5.14), from (5.41) it follows that lim n ^oo J pdp n = $ pdfi for all functions 
p e Coo (IRC). Theorem 5.26 then implies weak- lim r! ^ x fi n = ft,. This proves the 
continuity theorem of Levy: Theorem 5.42. □ 


f fdPn~ J 


fdp 


0. 


(5.41) 
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In the following theorem we compare several equivalent forms of weak conver¬ 
gence. If a = (ai,..., a v ) and b = (bi,... ,b v ) belong to R" and if Oj < bj , 
1 < j < v, then we write a < b and also (a, b] = (ai, b\] x • • • x (a„, b u \. 

5.43. Theorem. Let (p a : a e A) be a directed system (a net) in M + consisting 
of sub-probability measures (so that p a (R") < 1, a e A) and let p e M + be 
a sub-probability measure as well. Let (fk'.ke N) be a sequence in Co (R 1 ') 
with a linear span which is dense in Co (R"). The following assertions are then 
equivalent: 

(1) The net (p a : a e A) converges weakly to p; 

(2) For every bounded Borel measurable function f : R 1 ' —> C which is 
continuous in p-almost all points the equality lim a J fdp a = $ fdp holds; 

(3) The net (p a : a e A) converges vaguely to p and lim p a (R 1 ') = ft, (R 1 '); 

(4) For every closed subset F of R" the inequality limsup a p a (F) ^ p(F) 
holds and 

lim p a (R") = p (R"); 

a. 

(5) For every open subset G of W the inequality liminf a p a (G) ^ p(G) 
holds and 

lim p a (R") = p(R v ); 

a, 

(6) For every Borel subset B of ML, for which p ^B\B^j = 0, the equality 
lim a p a (B) = p(B) holds; 

(7) For every pair of points ( a,b ) e R" x R" such that a 3 < bj, 1 < 
j < v, where a = (ai,..., a v ), b = (&i,..., b v ), with the property 
that |i{iel": x 3 = a 3 } = p{x e ML : x 3 = bj} = 0, j = 1,..., u, the 
equality lim a p a (a, b\ = p(a , b\ holds and lim a p a (R") = p (R^). 

(8) For every k s N the equalities lim a J fkdp a = $ fkdp and lim Q p a (R") = 
p (R") hold; 

(9) For every xeT the equality lim a p a (x) = p{x) holds. 

(10) For every aeF for which p{x eW : Xj = aj} = 0, j = 1,..., v, the 
equality 

lim p a [(—oo, ai] x • • • x (—oo, a v f\ = p [(—go, ai] x • • • x (—go, a„]] 

a 

holds and lim a p a (R 1 ') = p (R"). 


Proof. The equivalence of the assertions (1) and (9) is a consequence of 
Theorem 5.25. The equivalence of (1) and (3) is a consequence of Theorem 5.26. 
The implication (1) => (8) is trivial. The implication (8) => (3) can be proved 
as follows. From (8) it follows that lim Q J pdp a = $ ipdp for all (p in the linear 
span of (f k : k e N) u {1}. So that for / 6 Co (R v ) + Cl and ip in the span of 
(fk : k e N) u {1} we see that 


lim sup 


I 


f d (p a — p) 
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< limsup 


+ lim sup 


J (/ - <P) d (jJa ~ /i) 

< ||/-^|loo limSU P(^a( IRly ) +V(W')) 

a 

= 21|/ - v>ll„ a* (R") - 


J ipd t/i,, - ft) 


(5.42) 


Assertion (3) follows because the linear span of (/& : k e N) u {1} is uniformly 
dense in Cq (M' y )+Cl. From the previous arguments it follows that the assertions 
(1), (3), (8) and (9) are equivalent. 


(2) => (1). This implication is trivial. 

(1) => (4). Let F be a closed subset of KF. Choose a sequence of functions 
(uj : j e N) in C'fe(M^) in such a way that lp < u j+i A Uj A 1, j £ N, and such 
that 1 f(x) = lirrq^a-j u 3 (x) for all x e IT. Then the equality 


lim sup Ha.{F) 

a 


< inf lim sup 

JS N Q 


Undu a = inf u 

J J ieNj • 


;dfj, 


>*(F) 


holds. This proves assertion (4) starting from (1). 


(4) (5). These implications are easy to verify. 
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(5) =^> (6). Let B be a Borel subset of E^ such that /i ( B\B ) = 0. Then, from 
(5), what is equivalent to (4), it follows that 


limsup Ha{B) ^ limsup // Q (T>) ^ /i (i?) = h\B 

a V / 

^ liminf ( B ) < liminf / i a (B) . (5.43) 

a \ J a 

Hence, lim a fj, a (B) = //(£>). 

(6) =^> (1). Let 0 < / < 1 be a continuous function. Because 

(fdfi= f fi {/>£}(%= f /i{/>£}d£ 

J Jo Jo 

we see that $o A 4 {/ = £} d£ = 0. Thus for almost all £ the equality // {/ = £} = 0 
follows. For a certain sequence (cq : £ e N) in A, we then obtain by (6) the 
following (in-)equalities 


f fd(i= f {f>£}d[i= f n{f>£}d£ 

J Jo Jo 

= f lim/i a {/ ^ £}d£ = i lim/i a {/ > £} d£ 
Jo “ Jo “ 


^ 4 2 {/ > fc2 ' n }+4 < 4 2 j™ ^ {/ > fc2_n )+4 


fc=i 


2 n 2 n ^ e^oo ' 
fc=i 


(Fatou’s lemma) 


< lim | — 

.£—►00 


on 

hS 

J 


l{f>k2-”-}dHa e + — 


< lim I fdu a , 4-= liminf f fdu a 4-. 

r->oo J J rat 2 n a J 17 ^ 2 n 

From (5.44) it then follows that, always for 0</<l, 


I 


fd/ji < liminf fd Hen 


J- 


and also 


^(1 — f)d/j, < lim inf J*(l — f)d/j, a - 


(5.44) 


(5.45) 


(5.46) 


Since, in addition, lim Q /i a (E 1 ') = /i (R") we see by (5.45) and (5.46) that 
lim a fd/j, a = $ fdfi for every function f e Cb (E") for which 0 < / < 1. Since 
the linear span of such functions coincides with Cb (E") assertion (1) follows 
from (6). (5) =^> (2). Let / be a real-valued bounded function which //-almost 
everywhere continuous. Without loss of generality we assume that 0 ^ / < 1 
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(otherwise replace / with af + 6, with a and b appropriately chosen constants). 
Then define the functions f n and f u respectively by 


f n (x)= inf sup / (y) en f J (x) = sup inf f(y). (5.47) 

UeU(x) yeU UeU(x) V eU 

It follows that $ (/ n — f' J ) d/i = 0 and also f' J ^ f ^ f n . Hence, for an 
appropriately chosen sequence (ry» : £ e N), 

2 n 

J fd P = | ran « 4 + i j _> ir > * 2 -"} 

2 n 

< jr + 2> *= 2 -”} 

fc = l 

2 n 

^ -- + liminf V g af {/ u > fc2 _n } 

9™ £—*co 2 n 1 J 


fc=l 


1 


f^dna < — + liminf | fdy 0 


1 f 

<-1- lim inf 

2 n « J 

From (5.48) it follows that 

fdjx F liminf I fdg a . 


J- 


I 


j- 


(5.48) 


For the same reason the inequality $(1 — f )d/i. < lim inf Q $(1 — f )d(i a holds. 
Because, in addition, lim Q ,^ a (M !/ ) = /i (E") we see that $/d/i = lim Q $ fd/j, a . 
This proves (2) starting from (5). 


(6) => (7). This assertion is trivial. 

(7) => (8). In this part of the proof we write (a, b] for the interval («i, 6i] x 
• • • x (a, y . b u ], if a and b are points in EF for which a,j < bj for 1 F j F u. From 
(7) it follows that lim a /r a (a,6] = /i(a, 5] for all points a and b in KF with the 
property that fi{y e IF : Xj = aj] = y{y e KF : Xj = bj} = 0 for all 1 ^ j ^ u. 
Next pick for / a function in Coo (E 1 ') with values in E. Let g be an arbitrary 
function of the form g = YTj =i /(T/OCcmbb w ^ ere a j and bj are points in E" with 
the following properties: / i{y eT: Vk = &j,k} = n{y eK": Vk = bj^} = 0, for 
1 < k ^ is, and < bj : k for j = 1,... ,n, and 1 ^ k < v. In addition, suppose 
that Xj belongs to the “interval” (a v 6 ? ]. The we get 

lim sup fdfi a < lim sup (/ — g)dfi a + lim sup gdfj, a 

a J a J a J 

< 11/ - piloo M (E 17 ) + f gdy < 2 11/ - gj^ /i (E 1 ') + J /d/i. 

Since / is uniformly continuous we are able to choose, for a given e > 0, a 
function g of the form as above in such a way that ||/ — r/|| x F e. This proves 
the inequality lim sup a $ fdg, a ^ $ fdg. The same argument can be applied to 
the function —/. It follows that lim Q J fdg a = $ fd,g for functions / e Coo (E") 
that are real valued. But then (8) follows. 
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(10) (7) Let a < b be as in (7). Then fi a (a, b\ can be written in the form 


A i a (a, b] 


Ac{l, 


n (-°°> ca .j] > 

j= 1 


(5.49) 


where = aj , j e A, caj = bj, j e {1,..., u} \A. The implication (10) => (7) 
then easily follows from (5.49). The equality in (5.49) can be found in Durrett 
[46] Theorem 1.1.6 page 7. 


(7) => (10) Let a be as in assertion (10). Put F = YYk=i ( —G0 ) a fc]- Then the 
subset F is closed, and since assertion (7) is equivalent to (4) we know that 
limsup a /r a (F) < fJ-(F). Since assertion (7) is equivalent to (5) we know that 


lim inf a ji a (F) ^ lim inf a //, Q 



^ [i 


F ). Since /i 



fj>(F) assertion 


(10) follows. 


The proof of these implications completes the proof Theorem 5.43. □ 
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5.44. Remark. The implication (10) => (1) in Theorem 5.43 can also be proved 
by employing the equality 


f f(x)dy a (x) = (~iy f D x - ■ ■ D„f(x) Ho 

JM*' JlR^ 


n (—°°’ x A 


j =i 


dx 


(5.50) 


r r 00 r 00 

= ••• Di ■ ■ ■ D w f (x) dx v ... dx\ dna(y), 

JR" Jy i Jy v 


where the function / is v times continuously differentiable, and where Dj de¬ 
notes differentiation with respect to the j-th coordinate, 1 ^ j ^ v. The 
equality in (5.50) can be proved by successive integration. The second equality 
is a consequence of Fubini’s theorem. 


5.45. Definition. A topological space E is called a Polish space if E possesses 
the following properties: 


(i) E is separable; 

(ii) E is metrizable; 

(iii) There exists a metric d on E that determines the topology and relative 
to which E is complete. 


Since E is metrizable property (i) is equivalent with the existence of a countable 
basis for the topology. 

5.46. Lemma. Let E be a Polish space. 

(a) A closed subset F of E is, with the induced metric, again Polish. 

(b) An open subset G of E is again Polish. 


Proof, (a) The proof of assertion (a) is not difficult. If (U 3 : j e N) is a 
countable basis for the topology of E , then (U :J n F : j e N) is a countable basis 
for the topology on F. Moreover, F is closed and hence it is complete with 
respect to the induced metric. 


(b). Let d be a metric on E, which turns E into a complete metric topological 
space. Then the open subset G is Polish for the metric do defined by 


d G (x,y) = d(x,y ) + 


1 

d (, x , G c ) 


1 

d(y,G°) 


(5.51) 


where x and y belong to G, where G c = E\G and where d (x, G c ) = inf d(x, z). 
The separability of G is also clear. The proof of Lemma 5.46 is now complete. 


□ 


5.47. Theorem. A subset A of a Polish space E is again a Polish space if and 
only if A is the countable intersection of open subsets of E. 


PROOF. Let A = P| ;6N G v where every Gj is an open subset of E. Let d be 
a metric on E, which makes E into a Polish space. Define then the metrics dj 
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on Gj , j e N, as in (5.51) and define the metric cIa on A via 

x, V )- vIJM,. 

l + dj(x,y) 

Then, endowed with the relative topology, A is a Polish space with respect to 
the (1a ■ Conversely, let d G 1 be a metric on A which is compatible with the 
topology that A inherits from E, and which turns A into a complete metric 
space. Let A be the closure of A. Then there exists a decreasing sequence 
of open subsets (G n ) nefi such that A = P| n G n ; i.e. closed subsets of E are 
Gb-subsets. For n e N, n # 0, we define the open subset A n of A as follows: 

A n = -[a: 6 A : there exists an open neighborhood U(x) in E of x 

for which d(y, z) < — for all z, y e U(x) n A 1 .. (5.52) 

n ) 

Then the following assertions about the sets A n will be proved: 

(1) For all n e N we have A a A n . 

(2) The sets A n , n e N, are open in A. 

(3) The inclusion f) n A n c= A holds, and so by (1) A = f~) n A n . 

Since, by (2), the subsets A n , neN, are open in A there exist open subsets O n , 
n 6 N, of E such that A n = O n n A, n e N. It follows that 

A = P'1 A n = |^| O n n A = O n n ^ G m = ^ O n n G n , 

n n n m n 

and hence, A is a countable intersection of open subsets of E. Next we prove 
the assertions (1), (2) and (3). 

(1) Pick x e A, and consider the ball 

Ei/( 2 n) (x) = l [ weA: d (w, x) < 

There exists an open neighborhood U(x) of x in E such that Bi/( 2n ) (x) = 
An U(x). If y, z belong to A n U (x) we have d(z, y) < d(z, x) + d(x, y) < 1/n. 
It follows that x 6 A n . This is true for all neN. 

(2) That the subset A n is open in A can be seen as follows. Pick x e A. There 
exists an open neighborhood U(x) in E of x such that d(z. y) < 1/n for all 
z, y e A n U(x). The set U(x) n A is an open neighborhood of x in A. It 
suffices to show that U(x) n A a A n . To this end choose x' e U(x) n A. Then 
U(x) is an open neighborhood in E of x' as well, and since x' belongs to A it 
follows from the definition of A n that x' is a member of A n . 

(3) Let x belong to A n for all neN. Then x belongs to A. We will prove that 
x e A. Let D < 1 be a metric on E which is compatible with its topology, and 
which turns E into complete metric space. For the moment fix n e N. Since 
x e A n there exists an open ball B n in E relative to the metric D centered at x 
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and with radius < 1/n such that 

d(z,y) < — whenever y, z e Ac\ B n . (5.53) 

n 

Since x e A there exists x n e A n B n . In this way we obtain a sequence of balls 
{B n : n e hi} relative to the metric D centered at x, which we take decreasing, 
and which are such that the D-radius of B n is strictly less than 1/n. In addition, 
we obtain a sequence of points {x n : n e N} in A such that x n e B n for all n e N. 
Since the balls B n are decreasing we have x m e B n for rri p n. From this fact and 
(5.53) it follows that d(x m ,x n ) < 1/n for m ^ n. Consequently, it follows that 
the sequence {x n : n e N} is a d-Cauchy sequence in A. Since A is d-complete 
there exists a point x' e A such d(x n ,x') < 1/n, n € N. Since D and d are 
topologically compatible on A it also follows that lim„^, x , D (x n , x r ) = 0. We 
also have hm n ^ x D (x n , x) = 0, and consequently x = x' e A. 

This completes the proof of Theorem 5.47. □ 

For a proof of the following theorem the reader is referred to the literature. We 
will give an outline of a proof. A S^-set in a topological space is a countable 
intersection of open subsets. 

5.48. Theorem. A Polish space E is homeomorphic with a Ss-subset of the 
Hilbert-cube [0,1] N , endowed with the product topology. 


> Apply now 
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Proof of Theorem 5.48. Let d : E x E —> [0,1] be a metric op E 
which is compatible with its topology, and which turns E into a Polish space, 
and let (ay : £ e N) be a countable dense subset of E. Define the mapping 
T : E —► [0,1] N by \k(a;) = (d (oyay) teN ). Then, as can be checked, the mapping 
4Ms a homeomorphism from E onto a subset of [0, l] 11 . So far we have not 
yet used the fact that E, equipped with the metric d is complete; we did use 
the fact that E is a metrizable separable space. Since E is complete with 
respect to d and is a homeomorphism, it follows that the image of E under 
T, that is A = T ( E ), is complete subspace of the Hilbert cube [0,1] N . Let 
D : [0,1] N x [0,1] N —* [0,1] be the metric defined by 

OO 

D ((^)^ eN , (%)&=n) = 2 |& — Vt\ j (<^)&=N ? (^)^eN G [O’ 1] • 

£=1 

Then D is a metric on [0,1] N which turns this space into a Polish space. It 
follows that A is a subset of [0,1] N which is homeomorphic to a Polish space, 
and so it itself is Polish. Since it is a Polish subspace of the Polish space [0,1] N , 
A is a countable intersection of open subsets of [0,1] N : see Theorem 5.47. This 
completes the proof of Theorem 5.48. □ 


The space N of positive integers with the usual metric inherited from the real 
numbers M is Polish. Then the countable product N n with metric 


OO 

^(w, w) = Yi 

i —1 


1 | m,i — nf\ 

2 * 1 + | rrii — rti 


(5.54) 


is Polish. The proofs of Propositions 5.49 and 5.50 are taken from Garrett [ 57 ]. 


5.49. Proposition. Totally order N N lexicographically. Then every closed sub¬ 
set C of N n has a least element. 


The lexicographic ordering of N 11 can be recursively defined. An element a = 
(cq, a 2 ,...) precedes an element b = (iq, b 2 ,...) if ai ^ bp, however, if Gq = b±, 
then a 2 < b 2 ; however, if oq = b\ and a 2 = 6 2 , then a 3 < b 3 , and so on. 


Proof. Let rq be the least element in N such that there is x = (rq,...) 
belonging to C. Let n 2 be the least element in N such that there is x = 
(ni,n 2 , • • •) belonging to C , and so on. Choosing the n, inductively, let :t'o = 
(ni, n 2 , n + 3,...). This oq satisfies xq < x in the lexicographic ordering for 
every x e C, and ay, belongs to the closure of C in the metric topology introduced 
in (5.54). This completes the proof of Proposition 5.49. □ 

5.50. Proposition. Let E be a Polish space. Then there exists a continuous 
surjective mapping F 0 : N n —> E. Moreover, there exists a measurable function 
Go : E —► N n such that Fq o Go(y) = y for all y e E. 

Proof. The mapping F 0 can be constructed as follows. For a given e > 0 
there is a countable covering of E by closed sets of diameter less than e. From 
this one may contrive a map F from finite sequences {?q,..., rq} in N to closed 
sets F (ni,..., rq) in E such that 
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(1 )F(0) = E; 

(2) F(m,...,n k ) = U2=1 F (m,... ,n k ;£); 

(3) The diameter of F (n i,..., n k ) is less than 2~ k . 

Then for x = {rq} e N n the sequence = F (ni,..., n k ) is a nested sequence 
of closed subsets of E with diameters less than 2~ k , respectively. Thus, the 
subset P| kEk consists of a single point F 0 (y) of E. On the other hand, every 
x e E lies inside some ("] k E k . Continuity is easy to verify. The mapping Go 
can be constructed as follows. The space N N , endowed with the lexicographical 
ordering is totally ordered, and by Proposition 5.49 every closed subset contains 
a least element for this order. For y e E the subset Ff 1 (y) is closed in N n , and 
therefore it contains a least element Go(y). This assignment is a measurable 
choice (because it can be performed in countably many steps). Then F 0 oGo(y) = 
y for y e E. The proof of Proposition 5.50 is complete now. □ 

The proof of the following theorem is based on the fact that a Polish space is 
homeomorphic with a S^-subset of the Hilbert cube, which, being a countable 
product of closed intervals, is a compact metrizable space. 

5.51. Theorem. Let y be a finite positive measure on the Borel field of a Polish 
space. Then y is regular in the sense that 

y(B) = inf {y(0) : O open, B c= 0} = sup {y{K) : K compact, K a B}. 

(5.55) 

PROOF. Let T and A be as in the proof of Theorem 5.48. Then 4/ : E —* A 
is a homeomorphism. Let y ^ 0 be a finite measure on the Borel held of E. 
Define the measure u on the Borel held of [0,1] N by 

u(B) = y [T '(Bn H)] = y [T e B n A] , B Borel subset of [0,1] N . (5.56) 

Then, since the Hilbert cube is compact and complete metrizable, and A is a 
Ss-subset of the Hilbert cube, we see that the measure v is regular on the Borel 
held of [0,1] N . It also follows that the restriction of v to the Borel held of A 
is regular. However, under the homeomorphism T : E —> A the Borel subsets 
of E are in a one-to-one correspondence with those of A. It easily follows that 
the measure y is regular in the sense of (5.55), which completes the proof of 
Theorem 5.51. □ 

5.52. Theorem. The following assertions hold for Banach spaces. 

(a) Let E be a separable Banach space. Then its dual unit ball B', endowed 
with the weak*-topology, is a Polish space. 

(b) (Helly) The set is compact-metrizable, and thus Polish for the 

vague topology. 

Let E' be the topological dual space of E. The weak*-topology is denoted by 
a:=a(E',E). 

Proof, (a) Let (x n : n e N) be a sequence in the unit ball B of E of which 
the linear span is dense in E. Dehne the mapping $ : (B r , a) —> [0,1] N via the 
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map x' i—> (< x n ,x' >)“ =1 . The mapping $ is continuous and, by the Theorem of 
Banach-Alaoglu, the dual unit ball B' is compact relative the topology a ( EE). 
It follows that is a compact subset of [0,1] N . This image is Polish, and 

because the inverse of $ is continuous, B' itself is Polish as well. 

Let r be a positive real number. Let the set M^ r be defined by 

M+ r = {fj, e M+ : // (IT) E r} , 

and let M+ be given by 

M+ = {|ieM +:ijl (R") = r} . 

(b) Since is a vaguely closed subset of M = Co (IT)*, by the Theorem of 
Banach-Alaoglu it follows that is compact for the vague topology. The 
fact that the set is Polish will be proved in Theorem 5.54. This completes 
the proof of Theorem 5.52. □ 

5.53. Definition. A subset A of M is a Prohorov subset, if it satisfies the 
following two conditions: 

(a) sup^ |/i| (IT) is finite; 

(b) For every e > 0 there exists a compact subset K of IT such that 

M (K"VC ss f 

for all measures fi e A. 
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5.54. Theorem, (a) The spaces and are Polish with respect to the 
vague topology. 

(b) The spaces and M] 1 " are Polish with respect to the weak topology. 


Proof. The countable collection 


oo f n n 

LJ j X a 3^ x j : a 3 G ^ X <y 'i = 1 

n=i vi=i j=i 

is dense in for the vague as well as for the weak topology. The countable 
collection 

oo f n n 

U ) Sj : otj e Q, a ^ 0, 0 < X otj 1 
n=1 vi=i j=i 

is dense in for de vague as well as the weak topology. This can be seen 
as follows. Let / be a bounded continuous function defined op l". Fix e > 0, 
and choose a partition of M" in Borel subsets (Aj : j e N) in such a way that 


I f{x) - f(y) | < 


p(R u ) 

choose N e N so large that 


p i 


for all x, y e Aj, and this for all j e N. In addition, 


N 


') - 2> M 

i=i 


< e. 


Put aj = p (Aj) and choose x 3 e Aj. Then we have 


I 




fdp - 


Zf=i %/( 


Xa 




oo r 

51 


+ 


Zjj=i a i 

(./'(• r ) - /(•'j)) d K x 

00 r* 

X f( X ) d " 

i— AT-L 1 


+ 


AT 


2 a jf( x j) 1 - 
t=i 


//(R") 

s~^\N 

/ jj —i 


j=AT+l 

oo 


AT 




^ ( R(/ ) 


X! d (A?) + 2 i/iioo i i i ( RI/ ) - X ^ ( a j) r ^ 3e - 


3 =1 


Appealing another time to the continuity of the function /, and using the fact 
that the rational numbers are dense in M we obtain the separability of the 
sets lYt+ and Mj r relative tot the vague as well as the weak topology. We 
indicate metrics which turn these spaces into Polish spaces. Therefore we choose 
a sequence of functions (fi- : k e N) in {/ e Coo (R") : 0 < / < 1} whose linear 
span is uniformly dense in Co (M' y ). We also choose a sequence (v/,/- : t e N) in 
Coo (R v ) such that 1 ^ u ^ + 1 5= ^ 0 and such that 1 = lim .£^. aa ug(x) for all 
x 6 M". 
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(a) Define the distance d v on via the formula 


00 

d v = Yi 

3 = 1 

where // and n are members of M j r . Supplied with this metric the space 
is Polish for the vague topology. The spaces 3VLt r , 0 < r < 1, are also closed for 
the vague topology. Since 

00 

K = fl (5-58) 

n=l 


1 

¥ 


j fjd[i - J 


fjdu 


(5.57) 


we see that M] 1 " is a Polish space: see the Theorems 5.47 and 5.52. 


(b) Let /o = 1 and let the sequences (fk'.ke N) and (ug : £ e N) be as above. 
We define the metric d w by the equality 


dyAji, v) = sup 





(5.59) 


where /.i and v belong to 3VfJ r . Supplied with this metric the space M^ r is Polish 
for the weak topology. If we are also able to prove that the space M^ r is complete 
relative to the metric d w , then it follows that the spaces M^ r , r > 0, are Polish. 
These spaces are also weakly closed. By the equality in (5.58) in Theorem 5.47 
then implies that is a Polish space as well. Now let (fi m : m e N) be a d w - 
Cauchy sequence in Mj r . We will prove that this sequence is a Prohorov set in 
M^ r . Choose e > 0 arbitrary. Then there exists M e e N such that for m and 
m' M e the inequality 


sup 

•feN 


J Uld/Jm ~ J Ugd/J, m > 


e 

< - 
4 


holds. So it follows that 


l/bn (R 17 ) - AW 0R")| < | } m, m! ^ M e 


From (5.60) and (5.61) it follows that 


sup 

£eN 


J (1 - Uf) dn m - J (1 - Ug) dll. 


e 

m' ^ 2’ 


(5.60) 


(5.61) 


(5.62) 


for m and m r ^ M e . Then from (5.62) it follows that 


J (1 -ug)dn m ^ J (1 -u e )d/j, Me + (5.63) 

From (5.63) it then follows that for £ 5= £ e and rn ^ M e the following inequality 
holds: 

j + 5 = f. (5.64) 


337 


Download free eBooks at bookboon.com 












Advanced stochastic processes: Part II 


Some related results 


From (5.64) it then follows that for all t ^ L e and for all m e N the inequality 
(1 - u e ) dfi m < e. (5.65) 


J' 


holds. Then choose the compact subset K e equal the support of the function 
ul v It follows that /i m (M. v \K e ) ^ e for all m e N. Let fi be the vague limit of 
the sequence (fi m : m e N). This limit exists, because is compact for the 
metric d v . Then pick a function u e Coo (M") in such a way that 1 ^ u ^ 1 K e - 
By the equality 


J fdfi m - J fdfi = J (/ - fu) dfj, m + J 

we infer the inequality 

J fd^m - J fdfi 


fudfi, 


- J fudfi - J (/ - fu) dfi 


< 


J (1 u) dfi m + (1 — idj d/i ^ ~\~ I fudfi m I fud/i 


J fnd,i m - I fud/i 


■ l/L (p m (K-\A'«) + H (K-\A'«)) + 

« ll/lloo (R “\K t ) + supyn™ (R-\if«)l + J 


fud/M. 


1 


fudfj 


< 2e 


fudfL m J 


fudfi 


(5.66) 


Since fi = vague- lim m fi m from (5.66) it also follows that // is the weak limit of 
the sequence (fi m \ me N). 


The proof Theorem 5.54 is now complete. 


□ 


Part of the proof of Theorem 5.54 comes back in the proof of Theorem 5.55. 

5.55. Theorem. A subset S of is relatively weakly compact if and only if 
S is a Prohorov subset. 


PROOF. First, suppose that S' is a Prohorov subset of M^ r . Let ( fi m :meN) 
be a sequence in S. We will prove that there exists a subsequence that con¬ 
verges weakly. We may assume that, possibly by passing to a subsequence, 
that this sequence converges vaguely. By employing the Prohorov property we 
will show that, in fact, this sequence converges weakly. Let e > 0 be arbitrary. 
Then there exists a compact subset K such that fi m ( K c ) < e and also that 
ft ( K c ) < e. Then choose u e Coo QR 1 ') in such a way that 1 ^ u ^ 1^. Then we 
have (see the final part of the proof of Theorem 5.54): 



(1 — u 




fudfi 
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J - I fudp 


< ll/IL it*™ (R“\if) + it (r"\a:)) + 

7 ll/IL (/',„ IE" A'] + sup;i m fP:'' A/) + J fndii m - J fndji 


< 2e 


J /«<L™ - I fud/i 


(5.67) 


Since // = vague- lim m //,,, from (5.67) it also follows that /i is the weak limit of 
the sequence (// m \ me N). 

Conversely, suppose that the set S' is weakly compact. We will prove that S 
possesses the Prohorov property. Assume the contrary. Then there exists an 
77 > 0 and there exists an increasing sequence of compact subsets ( K m :meN) 
with the following properties: 


(a) K m a K m+ i and W = (J ® =1 K m ] 

(b) For every meN there exists a measure fi m e S such that fi m {Kff) ^ r/. 


Then choose a sequence of functions (u m e Coo (®C)) suc h that 1 K m A u m+ 1 < 
From (b) it follows then that, for m ^ 7, 

V J (1 - 1 K m ) dn m < J (1 - l Ke ) dfi m < J (1 - U() dn m . (5.68) 

Since the subset S is relatively weakly compact, there exists a subsequence 
(hm fc : k e N) which converges weakly to a measure //. From (5.68) we then 
see that r/ < S(1 — U £) dfi for all £ e N. From this we obtain a contradiction, 
because the sequence (1 — ug : 7 e N) decreases to 0. 

This completes the proof of Theorem 5.55. □ 


5.56. Corollary. Let (// m : m eN) be a sequence of measures in 3VC + and let 
H also belong to M + . Choose a sequence of functions ( fk'.ke N) in 

{/ e Coo (IT) : 0 < / < 1} 

with a linear span that is uniformly dense in Cq (M") and choose another se¬ 
quence (ii£ : £ s N) in Coo (®C) such that 1 ^ rq+i ^ U£ ^ 0 and such that 
1 = lim^oo ui(x) for all x e R". Define the metric d w by the equality 


d w (u 1 ,v 2 ) = sup 

teN 


I 


U(du 2 




1 


where zq and v 2 belong to M + . Then the following assertions are equivalent: 


(i) The sequence (// m :meN) converges weakly to /.// 

(ii) lim m ^oo d w (// m , p) = 0 . 


Proof. The proof is left as an exercise for the reader. 


□ 
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3. A taste of ergodic theory 

In this section we will formulate and prove the pointwise ergodic theorem of 
Birkhoff. We also indicate its relation with the strong law of large numbers. 
We will also show that the strong law of large numbers (SLLN) implies the 
weak law of large numbers (WLLN). However, we begin with von Neumann’a 
ergodic theorem in a Hilbert space. In what follows the symbol H stands for 
a (complex) Hilbert space with inner-product (•, •) and norm ||-||. An operator 
U : H —> H is called unitary if it satisfies U*U = UU* = I. An operator 
P : H —*■ H is called an orthogonal projection if P* = P = P 2 . Let L be closed 
subspace of H. Then H can be written as 

H = L © L l = PH + (/ - P) H, 

where P : H —* H is an orthogonal projection with range L. The following 
theorem is the same as Theorem 7.1 in Romik [115]. 
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5.57. Theorem. Let H be a Hilbert space, and let U be a unitary operator on 
H. Let P be the orthogonal projection operator onto the subspace N (U — I) 
(the subspace of H consisting of U-invariant vectors). For any vector v e H the 
equality 

-t n—1 

lim -Yu k v = Pv. 

oo n “f 
k —0 


holds. (Equivalently, the sequence of operators 
to P in the strong operator topology.) 



n—1 


k =0 


neN 


converges 


Proof. Define the subspace V a H by V = N(U—I) = {v e H : Uv = v} = 
N (U* — I). Then 

V = ( R(U - I)) 1 := {v 6 H : ((U - I) u,v) = 0 for all u e H) , (5.69) 

where R(U — L) is the range of the operator U — /, i.e. 

R(U — I) = {Uu — u : ue H} . 


Let L be any linear subspace of H. From Hilbert space techniques it is known 
that ( L x ) coincides with the closure of the linear subspace L. From these 
observations it follows that the subspace N (U — /) + R(U — /) is dense in H, 
and that the subspaces N (U — I) and R{U — I) are orthogonal. Moreover, we 
have 


n—1 


n 


XT* 

k =0 


^ - 
n 


n—1 

- X IF' 

n 11 


u < ||u|| , V G H. 


k =0 


Define the subspace L a H by 


L = \ ve H : lim - V U k v = Pv 1 , 

n^oc n 

V ^=0 J 


(5.70) 


(5.71) 


where P is the orthogonal projection onto the space V = N(U —/). From (5.70) 
it follows that L is a closed subspace of H. If v e V, i.e. if Uv = v, then v 
belongs to L. If v = (U — I) u belongs to the range of U — /, then 


n—1 n—1 

- V u k v = - V U k (U - I) u = - (U n - I) u, 
n f-i n f-j n 

k= 0 /c=0 


(5.72) 


and so by (5.70) and (5.71) it follows that 


n—L 

lim - V u k v = 0 = Pv, (5.73) 

77—>00 n 

k —0 


whenever v belongs to R(U — /). Again appealing to (5.70) shows that (5.73) 
also holds for v belonging to the closure of R(U — /). Altogether it shows that 
the snbspace L coincides with H. This completes the proof of Theorem 5.57. □ 
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Let (£7, T. P; T ) be a measure preserving system. We associate with the measure 
preserving mapping T : £7 —*■ £7 an operator Ut on the Hilbert space L 2 (£7,T, P) 
defined by Ux(f) = f°T , /ei 2 (£7, T, P). The fact that T is measure preserving 
implies that U^Ut = /: 

{UtS, U T g) - E [U T fTbii\ - E [/ o Tj^T\ = E [(fg) o T] - E [fg] - </, g). 

(5.74) 

From (5.74) it follows that U^Ut = /. In order that Ut is a unitary operator 
it should also be surjective. Since the range of Ut is closed, this is the case 
provided that the set of functions / o T, f e U 2 (£7. T, P), constitutes a dense 
subspace of L 2 (£7, T, P). The latter is true if the mapping T has the property 
that there exists a measurable mapping T : £7 —» £7 such that T o T(u) = oj for 
P-almost all c u. Then the operator U defined by Uf = f o T, / e L 2 (£7,T, P) 
is the adjoint of Ut- This can be seen as follows. Now we do not only have 
U*U T = /, but we also have U T Uf = Uf o T = f o f o T = f, f e L 2 (£7, T, P). 
Hence, we see 

U = (U^Ut) U = U£ (u t u\ = U£. 

Note also that the subspace N(U — I ) consists exactly of the invariant (square- 
integrable) random variables, or equivalently those random variables which are 
measurable with respect to the cx-algebra J of invariant events. Recalling the 
discussion of conditional expectations in Theorem 1.4, item (11), in Chapter 1, 
we also see that the orthogonal projection operator P is exactly the conditional 
expectation operator £ [• | U] with respect to the cr-algebra of invariant events 
3. Thus, Theorem 5.57 applied to this setting gives the following result. 

5.58. Theorem (The L 2 ergodic theorem). Let (£7,T,P;T) be a measure pre¬ 
serving system. For any random variable X e L 2 (£7, T, P) the equality 

-t 77—1 

lim - V X o T k = E \X I Jl (5.75) 

77 — >00 Tl ^ L I J 

0 

holds in L 2 (£7, T, P). In particular, if the system is ergodic then 

77—1 

L 2 - lim - V XoT k = E[X ]. (5.76) 

77 *00 Ti “f 

k—0 

Since the operator S : L 2 (£7, T, P) —► L 2 (£7,3 r , P), defined by Sf = f o T, 
/ 6 L 2 (£7,T, P), is not necessarily unitary, Theorem 5.58 requires a proof. It 
only satisfies S*S = I. The proof below is based on the proof of Theorem 5.66 
below. 


PROOF. Theorem 5.58 is a consequence of Theorem 5.59 which includes the 
Id-version of Theorem 5.58. More precisely, we have to prove that 


lim E 

77—>00 


-i 77—1 

-L x °T k - Eprp] 

k —0 


0 . 


(5.77) 
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Let X e L 2 (f2,T, P) be bounded. Then we can use Theorem 5.59 together 
Lebesgue’s theorem of dominated convergence that the equality in (5.77) holds 
for X. A general function X e L 2 (Q, T. P) can be approximated by bounded 
functions in Id-sense. Since $ |/ o T | 2 d/j, = ^ \ f\ 2 dfi , and so the convergence in 
(5.77) also holds for all Id-functions. The precise argument follows as in (5.122) 
below with Sf = f o T, f e Id (0,T, P), and relative to the Id-norm instead 
of the Id-norm. So let / belong to L 2 (12, T, P), and let ill > 0 be an arbitrary 
real number. Then we have: 

71—1 



(/1{|/|<M}) 


+ 


L 2 


\ k—0 


(/l{|/>Af}) 


L 2 


- h sk - P A (fU\f\<m) 


1/2 


dP 


+ 2 


I 


i/r dp 


ll/l >M} 


1/2 


(5.78) 


As in the proof of Theorem 5.66 in (5.78) we first let n 
to obtain the Id-convergence in Theorem 5.58. 


oo, and then M 


oo 

□ 
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The pointwise ergodic theorem of Birkhoff requires some more work. In what 
follows (Q, T, //) is positive measure space, and T : Cl —*• Cl is a measure preserv¬ 
ing mapping, i.e. g{T e A} = g {T _1 A} = g {A} for all A e T with g {A} < co. 
An equivalent formulation reads as follows. For all / e L 1 (Cl, T, g) the equality 
\ f oT dg = ^ f dg holds. In other words the quadruple (Cl, T, P; T) is a measure 
preserving system, or dynamical system. The operator P t , in (5.80) is a projec¬ 
tion mapping from L 1 (Cl, T, /i) onto a space consisting of T-invariant functions. 
Hence a function of the form g = P llf f satisfies g o T = g //-almost everywhere, 
and so g is measurable with respect to the invariant cr-field J. In addition, if h 
is a bounded, T-invariant function in L 1 (Q, T. g), then we have 

J (T m /) hdg = J (//i) dg = J fhdg. (5.79) 

In other words the function P^f is the //-conditional expectation of the function 
/ on the cr-field of invariant subsets. A measure preserving system (Q, A, //; T ) 
is called ergodic if a T-invariant function is constant //-almost every, and so the 
cr-field J is trivial, i.e. 0 consists, up to sets of //-measure zero, of the void set 
and of Cl. 


5.59. Theorem (The pointwise ergodic theorem in L 1 ). Let (fl, T, //;T) be a 
measure preserving system. For any function f e L 1 (fl, T, g) the equality 


lim — 

n—KX) Jl 


n— 1 


^/oT fc 

k =0 


P,f 


(5.80) 


holds g-almost everywhere. In particular, if the system is ergodic then the equal¬ 
ity 


lim — 

h—> oo n 


i n_1 r 

-Y A foT k = \ 

n to J 


f dg. 


(5.81) 


holds g-almost everywhere. If g is a probability measure, then the limits in 
(5.80) and (5.81) also hold in L 1 -sense, and P^f = [/ | 9]. 


PROOF. The proof of Theorem 5.59 follows from Theorem 5.66 and its Corol¬ 
lary 5.67 with g (Cl) = 1, and Sf = foT,feL 1 (f1, T, //). □ 


Let (SdoiTojP) be a probability space, and let Xj : —* P, j = 0, 1, ..., be 

a sequence of independent and identically distributed variables (i.i.d.). Let us 
show that the SLLN is a consequence: see Theorem 2.54. Put S n = YjkZ o AT. 


5.60. Theorem (Strong law of large numbers). The equality 


v S n 

lim — = a, 
n —>go n 


holds P -almost surely 


for some finite constant a, if and only if E [|X^|] < oo, and then a 


EM- 


PROOF. Let Cl = M N , endowed with the product cr-field 1 = (x)J =0 23 ; where 
23 j is the Borel field on M. Define the probability measure g on T by 

g(A) = E [1 A (X 0 ,X U ...)] =F[(X 0 ,X U ...) e A], A e T. 


344 


Download free eBooks at bookboon.com 


Advanced stochastic processes: Part II 


Some related results 


Put Sf (x 0 ,xi,...) = f (xi,X 2 , ■ ■ ■ ), f 6 L 1 (ft, S', p), (x 0 ,xi,...) e ft. Then 
^Sf dp = J / dp, / e T 1 (ft,S)/r). The assertion in Theorem 5.60 follows from 
Theorem 5.66 and its Corollary 5.67 by applying them to the function /o : ft —» 
M defined by /o (xo, x\,.. .) = xo, (xo, xi,...) e ft. Then 

71 — 1 71—1 71—1 

(, S k f 0 ) (Xo,^, X 2 , ...) = 2 / 0 (X k , X k+1 ,...) = X k , k = 0, 1, ..., 

fc=0 k —0 /c=0 

and hence Theorem 5.60 is a consequence of Theorem 5.66 and its Corollary 5.67. 
Theorem 5.60 also follows from Theorem 5.59 by applying it to the mapping 
T : ft —► ft given by T (xo, xi,...) = (xi, x 2 ,...), (xo, xi,...) e ft. □ 

We will formulate some of the results in terms of positivity preserving operators 
S : L 1 (ft, S', p) —> L 1 (ft, S, p). 

5.61. Lemma. Let S : L 1 (ft, T, /i) —> L 1 (ft, T, //) 6e a linear map, and let f ^ 0 
belong to L 1 (ft, T,/r). Then the following assertions are equivalent: 

(i) min (5/, 1) = 5 (min (/, 1)); 

(ii) max (5 1 / — 1, 0) = S (max (/ — 1,0)); 

(iii) min (S'/, 1) < S (min (/, 1)), and max (Sf — 1, 0) = S (max (/ — 1,0)). 

Suppose that for every / > 0, / e L 1 (ft, T, /r) the operator S satisfies one, and 
hence all of the conditions (i), (ii) and (iii). In addition, assume that 

f dp, for all f s L 1 (ft, T, p), / > 0. (5.82) 

Then S is positivity preserving in the sense that / ^ 0, / e L 1 (ft, T, p), implies 
Sf ^ 0, and contractive in the sense that $|S/| dp ^ J|/| dp for all f e 
L 1 (ft, T, /i). Then the equivalent conditions (iv), (v), (vi). and (vii) given by 

(iv) The equality S ( fg ) = (Sf) ( Sg) holds for all /eT (ft,//i), and for 
all g c L 1 (ft, T, p) f) T 00 (ft, T, p), 

(v) S (min (/, g)) = min (S/, Sg) is true for all f, g e L 1 (ft, T, p), 

(vi) The equality S (max (/ — 1,0)) = max (S/ — 1,0) holds for every f e 

V (ft.T.//). 

(vii) The Sl{/>i} = l{s/>i} holds for every f e L 1 (Q,T,p). 

are also true. Moreover, (vi) implies that if the assertions (i), (ii) and (iii) 
are true for all positive functions f in L 1 (ft, T, p), then they are true for all 
functions f in L 1 (ft, S', p). If the measure p is finite, then all assertions (i), 
(ii), (iii) (for all f ^ 0, / e L 1 (ft, T, p),) and (iv), (v), (vi) and (vii) are 
equivalent. Finally, the operator S : L 1 (ft, S', p) —► L 1 (ft, S', p) is continuous, 
more precisely, 

J \Sf\ dp = J S |/| dp < f |/| dp, feL 1 (ft, S, p). (5.83) 

5.62. Remark. Assertion (iv) also holds for all /, g e L 1 (ft, S', p) f) L 2 (ft, S', p). 
The equality in (v) can be replaced with 

S (max (f,g)) = max (Sf, Sg) for all /, g e L 1 (ft, S', p). (5.84) 


J \Sf\ dp ^ j 
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The latter is true because min(/, g) + max(/, <?) = / + g. 

PROOF of Lemma 5.61. The equivalence of the assertions (i), (ii) and (iii) 
follows from the following identities: 

min (Sf, 1) + S (max (/ — 1, 0)) = Sf = min ( Sf , 1) + max (Sf —1,0). 

Now assume that for all / ^ 0, the operator S satisfies (i), (ii) or (iii), and 
assume that S is contractive in the sense of (5.82). Let / ^ 0 belong to 
L 1 (Q, T, p), and put f n = max (/ — n~ l , 0). Then the sequence (/ — f n ) n de¬ 
creases to 0, and hence, by (5.82), lim n ^oo $ \Sf — Sf n \ dg = 0. Then there 
exists a subsequence (/ nfc ) fc such that the sequence ( Sf nk ) k converges to Sf p- 
almost everywhere. Hence Sf ^ 0 /.(-almost everywhere. In fact it follows that 
the sequence ( Sf n ) n increases to Sf //-almost everywhere. Let /ei 1 (f2,T, /i), 
and write / = /+ — /_ where /+ = max (/, 0), and /_ = max(—/, 0). Then 
|/| =/ + — /_, and \Sf\ < Sf + + Sf-. From (5.82) it the follows that 

J \Sf\ dji J (57+ + Sf-) dfi = f 5 (/+ + /_) d// = J 5 |/| dp < J I/I d/i. 

(5.85) 
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The inequalities in (5.85) prove the contraction property of the operator S. Next 
we prove the assertions (iv), (v) and (vi) starting from (i), (ii) or (iii) for all / 6 
L 1 (fi, T, fj), f ^ 0. Let the function / > 0 belong to L 1 (fi, T, //) P L 2 (fi, T, /r). 
Then we write 


f 


pOO 

2 max (/ — a, 0) da 

Jo 


n2 n 

sup 2~ n+1 2 max (/ - j2~ n , 0) , 


ad so 


s(f 2 ) 


p 00 

2 S' max (/ — a, 0) da 

Jo 


n2 n 

sup 2~ n+1 2 S (max (/ - 0)) 

n 


(apply assertion (ii)) 


n2 n 

= sup 2 _n+1 ^ max (A/ — j2~ n , 0) 
n j =i 

pOO 

= 2 max (S'/ — cr, 0) da = (S/) 2 . (5.86) 

Jo 

The equality in (5.86) shows that the assertion in (iv) is true provided that 
/ = g ^ 0. For general / = g we split / in its positive and negative part. 
For general / and g belonging to L 1 (f2, T, /i) (~) L 2 (f2, T, /i) we write 2 fg = 
(/ + g) 2 — f 2 — g 2 ■ Altogether this shows assertion (iv). (iv) => (v) Let / belong 
to L 1 (Q, T, p) P| L 2 (fi, T, p). Then we write 


fi = 2 r /2 

Jl vr J 0 t 2 + / 2 




and so by assertion (iv) we get 



(for explanation see below: equality (5.89)) 


= 2 r s (/ 2 ) 

vrJo i 2 + S(/ 2 ) 

((iv) implies S (/ 2 ) = (S/) 2 ) 



(Sff 
t 2 + (Sff 


dt = |S/|. 


(5.87) 


The equality in (5.87) shows that (5.84) is true for g = —f. For general 
f, ge L 1 (fi, T, //,) P L 2 (fi, 3\ p we write 2 max (/, g) = \f - g\ + / + g. Conse¬ 
quently, assertion (v) follows for /, g e L 1 (fi, T, //) P L 2 (fi, T, //). If / and g are 
arbitrary functions in L 1 (fi, T, p, then we approximate them by f n : = 
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and by g n := gl{| 5 |^n} respectively. This shows that assertion (v) is a conse¬ 
quence of (iv), except that the proof of the second equality in (5.87) is not 
provided yet. In order to prove this equality it suffices to prove that, for a > 0 
and g >0 ,ge L 1 (f2, T, /i) the equaiity 


sM- 

[a + g 

hoids. By assertion (iv) we have 


Sg 

a -\- Sg 


S 


9 

a + g 


(a + Sg) 


S 


= S 


ag ) 

\ +s \ 

f 9 

a + g J 

[a + g 

ag ) 

Hi 

f S 2 

a + g J 

[a + g 


Sg 

= Sg. 


(5.88) 


(5.89) 


The equality in (5.89) shows the validity of (5.88). Therefore the second equality 
in (5.87) is proved now. 


(ii) plus (v) => (vi) We apply (5.84), which is equivalent to (v), with / e 
L 1 (Cl, £F, /i) arbitrary and g = 0 to obtain 

S (max (/ — 1, 0)) = S (max (max (/, 0) — 1, 0)) 


(employ assertion (ii)) 

= max (S max (/, 0) — 1, 0) 

(apply assertion (v)) 

= max (max (Sf, SO) — 1,0) 

= max (max (Sf, 0) — 1,0) = max (Sf — 1,0). 

Hence, assertion (vi) follows from (ii) and (v). 

(vi) =^> (v) Let /eL 1 (Cl, SF, g). By assertion (vi) we have 

S max (/, 0) = lim S max (/ — e, 0) = limmax (Sf — e, 0) = max (Sf, 0). 

(5.90) 

Whence, S max (/, 0) = max (Sf, 0). Since |/| = 2 max (/, 0) — / we easily infer 
S \f\ = |*S'/|, and assertion (v) follows: see the proof of the implication (iv) => 

( v )- 

(ii) plus (v) (iv) Let / belong to L 1 (f2, T, /i) f) L 2 (0, T, /i). Then we write 

rCO 

f 2 = 2 max (|/| — a, 0) da, 

Jo 

and so by assertion (ii) and (v) we get 

rGO 

S (f 2 ) = 2 max (S \ f\ — a, 0) da 

Jo 

rOO 

= 2 max(|S'/! — a,0 ) da 

Jo 

= \Sf\ 2 = (Sf) 2 , 
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and hence (iv) follows, (vi) => (vii) Let / belong to L l (f2,T,/r). Then 1 {/>i} is 
//-hitcgraide as well. Then we have 

Sl//>n = lim S (min (ramax (/ — 1, 0))) 

m—>00 

= lim (min (mmax (S'/ — 1, 0))) = 1 { 5 '/ > i}. (5.91) 

m—KX) 

The equality of the ultimate terms in (5.91) proves the implication (vi) => (vii). 

(vii) => (vi) Let / belong to L 1 (O, T, //). Then the functions 1 {/>«}, cr > 0, are 
/i-integrable as well. We have 

rCC rCC 

S (max (/ — 1, 0)) = S l {/ _i> a} da = Sl {f _ 1>a] da 

Jo Jo 

r*oo 

= l{ S f-i>a}da = max (Sf - 1,0). (5.92) 

Jo 

The equality of the ultimate terms in (5.92) proves the implication (vii) => (vi). 

If the measure /i is finite, then the constant functions belong to L 1 (0. T, //). 
Since SI = 1, it is easy to see that assertion (iv) implies assertion (i), and hence 
by what is proved above, we see that for a finite measure // all assertion (i) 
through (vi) are equivalent. The equality and inequality in (5.83) follow from 
assertion (v) and the inequality in (5.82). This completes the proof of Lemma 
5.61. □ 
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5.63. Proposition. Let g and h be functions in L 1 (f), T, g). Define, for —oo < 
a < b < co, the subset Cby 

C a,b h} = {g< a <b<h}. (5.93) 

Then the following equality holds: 

Sl„{g,h} = 1 n {Sg,Sh}. (5.94) 

° a,b °a,6 

Proof. Since = {g < a < b < h} we may assume that b > 0. If 

a < 0, then 1 { g<a <b<h} = l{- g >-a}l{h>b}- From assertions (iv) and (vii) of 
Lemma 5.61 it follows that 

51 {g<a<b<h} 1{— Sg>— a} l{S7i>6} 1 {Sg<a<b<Sh}i 

and consequently (5.94) follows for a < 0 < b. If a = 0, then we replace a with 
a — e and let e j 0. If a > 0, then we consider, for 0 < z < a, 

l{g<a— £<b<h} = ^-{h>b} ^-{g>a—e}^-{h>b} ■ (5.95) 

Another application of the assertions (iv) and (vii) of Lemma 5.61 then yields 
by employing (5.95) the equality: 

51 {g^a—e<b<h} = 1 {Sh>b} l{S'g>a—e} l{S7i.>6} = ^-{Sg^a— e<b<Sh] • (5.96) 

In (5.96) we let e \ 0 to obtain the equality in (5.93) for 0 < a < b. This 
completes the proof of Proposition 5.63. □ 


We also need the following proposition. It will be used with g n = h n of the form 

n —1 

h n = - V S k f where / el 1 (Q, T, g). 

n f—l 


5.64. Proposition. Let {g n } n and {h n } n be sequences in L 1 (fl,9 r , g) with the 
property that for every c > 0 the subsets {sup n \g n \ > c} and {sup n \h n \ > c } have 
finite g-measure. Define, for —oo < a < b < co, the subset (j^ n ^ hn ^ n by 


/^l{9n} n ,{h n } n 


lim inf g n < a < b < lim sup h, 


Then the following equality holds: 


(5.97) 


SI 


^{9n } n ,{^n } n 


= 1 


r {Sg„},{Sh„} n . 
° a,6 


(5.98) 


Proof. We write the function l„{g n } n ,{hn}„ as follows: 

a a ,b 


1 


9n}n’{hn} n 

U a,b 


= sup inf sup inf min max 

N 1 N[ N2 N 2 Ni^m^N[ N 2 ^n 2 ^N 2 


If 

\9ni 


<a<b<h n 


(5.99) 


where the suprema and infima are monotone limit operations in L 1 (O. T, g). 
An appeal to assertion (v) in Lemma 5.61, to (5.83), and to (5.84) the equality 
in (5.99) implies 


51 


^{9n } n ,{^n } n 
U a,b 


sup inf sup inf min max 

Ni N [ N 2 N 2 Ni^ni^N[ N 2 ^n 2 ^N 2 


SI 


[gn 1 <a<b<h ri2 y 


(5.100) 
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The equality in (5.96) in combination with (5.100) then shows 




^{9n} n dhn} n 


sup inf sup inf 

jvi K n 2 K 


nun max i/~ \ 

N 2 <n 2 <N^ f ;: ’ 9n i <a<0< ' bftn 2 J 


1 


^i{Sgn} n ,{Sh n }n • 
°a,6 


(5.101) 


The equality in (5.101) completes the proof of Proposition 5.64. 


□ 


5.65. Theorem (Maximal ergodic theorem). Let S : L 1 (Q, T, p) —*• L 1 (ft, T, //) 
6e a linear map, which is positivity preserving and contractive. So that f e 
L 1 (ft, £F, /i) and / > 0 implies Sf ^ 0 and ||<S'/|| 1 < ||/||i- Define for f e 
L 1 (ft,£F, /i) the to S corresponding maximal function f by 

n— 1 

/ = sup-V S*/- (5.102) 

neN W 

Then the following assertions are valid: 


(a) If f belongs to L 1 (ft, Sq p), then f f dp ^ 0; 

J{/>o} 

(b) // in addition, min (S'/, 1) = S(min(/, 1)), for all f e L 1 (LI, T, p), 
f 5= 0, then for any a > 0 and any f e L 1 (ft,T,/i), the following 
inequalities hold: 


p {Sf > a] < p {/ > a }, and ap j/ > a j 


< 


in • 


(5.103) 


Observe that the second inequality in (5.103) resembles the Doob’s maximal 
inequality for sub-martingales: see Theorem 5.110 or Proposition 3.107. 


Proof, (a) Let / e L 1 (ft, T, p), and define, for n a positive integer, the 
function h n by 

k 


hn. 


max max ( 0, V* S- 7 / I . 

0sSfcsSn-l \ fr! J I 


(5.104) 


3=0 


Then we have h n+ 1 5= h n ^ 0, and for u> e Cl such that h n+ i(cj) > 0, we have 
Sh n (uj) + f(uj) ^ n+ i (a;). The latter inequality is a consequence of the inequality 


/ + Sh n = f + S ( max max ( 0, V* S- 7 / 

>/+ o^i 5 ( maX 

> o max ^ (max | / + SO, / + S S J+1 / 


i=0 


= max I max 

O^k^n—l 


k+1 

/. 2 sy 

J—o 


= max 


VJ =0 


(5.105) 
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From (5.105) it readily follows that 

/ + Sh n ^ h n+ 1 on {h n+ i > 0} . (5.106) 


Notice that in the arguments leading to h n+ i ^ h n , and also to (5.105), we 
employed the fact that g ^ 0, g e L 1 (fl, T, g), implies Sg ^ 0. From (5.106) we 
infer 


f ./' dg ^ 

{J^n+1 Sh n ) d/i 



J{h n+ 1 >0} J 

{h n+ 1>0} 



= | 

h n +\ d/i — 

Sh n dg 


J 

{^-n+l>0} ^ {hn+ 1>0} 


>J 

| h n+ 1 dg, — J Sh n dg ^ J 

h n+ 1 dg — J h n dg 



j' (tin+i ^n) dg ^ 0. 


(5.107) 

From (5.107) we obtain 




f fdg = f 

f dg = lim 

f dg ^ 0. 

(5.108) 

J{/>0} J\j™= 0 {h n+ i>0} n ^°°J{fe n+1 

>0} 



The inequality in (5.108) entails assertion (a). 
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(b) Let / ^ 0 belong to L 1 (fl, T, p), and fix a > 0. We first prove that 
p {Sf > a} < p {/ > a}, i.e. the first inequality in (5.103). Let m be a positive 
integer. By the extra hypothesis in (b), together with assertion (ii) in Lemma 
5.61, we have 

min (to max (Sf — a, 0), 1) = S (min (to max (/ — a, 0), 1)). (5.109) 

From (5.109) we deduce 

min (to max (Sf — a, 0), 1) dp = f S (min (to max (/ — a, 0), 1)) dp 

< J min (to max (/ — a, 0), 1) dp. (5.110) 
In (5.110) we let m tend to co to obtain: 

/i {Sf > a} = lim min (to max (Sf — a, 0), 1) d/i 

m—>oo J 

< lim min (to max (/ — a, 0), 1) d/i = /i {/ > a }. (5.111) 

771—> 00 J 

This proves the first inequality in (5.103). In order to show the second inequality 
in (5.103) we proceed as follows. Let / be a member of L 1 (Q, T, /x), and define, 

always for a > 0 fixed, the subset D by D = j/ > aj. Here / is as in (5.106). 
In addition, define for n a positive integer, the subset D n by 

n 

D n = (J {S k f > a} n D. 


k= 0 


Then we have 


d 


{D n } < J d {S k f > a} < J] p {/ > a} = (ra + l)/x {/ > a} 




71+1 


1/17 


fc =0 


fc =0 


n=l 


and so p {T> n } is finite. We also have D id D n+ i 3 D n , and D = D n . Hence, 
because for /eL 1 (Q. SF, p) we have 

f-a^f- al Dn + (/ - al Dn ), 

it follows that 

{T> n } = ~ al Dn dp = al Dn dp ^ __ al Dn dp 

9{/>a} 9{/-a>0} J{(/-al Dn )>0} 

= J _ _ , (o1d„ ~7) dp + |_ / dp (5.112) 


afi 


(alr>„ - /) dp + _ /dp 

^{(/-alo n )>0} 


/{(/-al Dn )>0} 

(apply assertion (a) to the first terem in (5.112)) 

<0+f _ /dp < ll/ll,. 

J{(/-«W>o} 


(5.113) 
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In (5.113) we let n tend to go and infer ap |/ > aj < \\f\h- This is the second 
inequality in (5.103), and completes the proof of Theorem 5.65. □ 

5.66. Theorem (Theorem of Birkhoff). Let S : L 1 (Q,T,p) —* L 1 (12, S',//) be 
a linear operator such that for every f ^ 0, / e L 1 (12, T, p), the following two 
conditions are satisfied: 

(i) S (min (/, 1)) = min ( Sf , 1); 

(ii) \\Sf\h ■= SI-5/1 dp < If dp = \\f\\ v 

(It follows that all properties mentioned in Lemma 5.61 are available as well 
as the Propositions 5.63 and 5.64, an d Theorem 5.65.) Then for every f e 
L 1 (12, T, /i) the pointwise limit 

n —1 

hm - V S k f =: PJ (5.114) 

n—► oo n 

k =0 

exists p-almost everywhere. In addition, P^f belongs to L 1 (Ll,T, p), and the 
operator P4 is a projection operator, i.e. Pj) = P u , with the following properties: 

(a) J \P/xf\ dp < S I/I dp, and 

(b) SP^f = P^Sf = P^f, where f belongs to L 1 (12 ,T,p). 

If the measure p is a probability measure, then the limit in (5.114) is also an L 1 - 
limit, and P^f = [/ | j], / el 1 (12, T, p), where E^ [/ | j] denotes the con¬ 

ditional expectation on the a-field of invariant events: J = {A e T : Sl^ = 1^}. 


Before we prove this theorem we insert a corollary. 

5.67. Corollary. Let the notation and hypotheses be as in Theorem 5.66. 
Suppose that the operator S is ergodic in the sense that SI = 1, and Sf = f, 
f e L 1 (fil,“J,p), implies f = constant p-almost everywhere. If p(Vt) = go, then 
P,,f = 0, / e L 1 (12, T,p). If p (12) = 1, then P l J = \f dp, f e L 1 (12, T, p). 

PROOF. Let / el 1 (fid, T, p). Then SP^f = P^f, and so by ergodicity P^f 
is a constant //-almost everywhere. If p (12) = oo, then this constant must be 
zero, because P^f belongs to L 1 (12, T, p). If p (11) = 1, then, by the Id-version 
of Theorem 5.66, we have 

J P^f dp = jim ^ ^ J S k f dp = J f dp, (5.115) 

and the inequality in (5.115) completes the proof of Corollary 5.67. □ 


Proof of Theorem 5.66. Define for / e Id (12, T, p), and -oo < a <b < 
oo the subset C[ b by 


r'f 

^a.b 


( I n—1 -i n-1 2 

< liminf — V S k f < a < b < limsup — V* S k f > . 

I n to — n to J 


(5.116) 
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Then C^ b belongs to £F, and by Theorem 5.65 it follows that /i 
Proposition 5.64 we see that 


nf 

U a,b 


1 ryS f 1 cu/ • 


< oo. By 


(5.117) 


As in equality (5.102) in Theorem 5.65 we write g for the maximal function 
corresponding to g e L 1 (0, T, g). Then, with C = C( b , we see 

n— 1 n— 1 

((a - f ) lc) = sup - 2 S* {(« - /) lc} = sup - 2 {a - S^ -1 /} l c 

n Tl , n n , 


n— 1 


= a-inf-V S*/] l c , (5.118) 

V n n fc=o J 

and so from (5.118) and the definition of C = b we see that 

Cc {((a^Tflc) >0}. (5.119) 

The inclusion in (5.119) together with Theorem 5.65 yields 

f (a - /) lc dfj, = f (a — f) lc d/i = f __ (a - f)l c dfi> 0. (5.120) 

J JC J{((a-/)1 C )>0} 

As a consequence (5.120) implies f d/i < a/i. (C). A similar reasoning shows 
that C a |((/ — b ) lc) > oj, and therefore, like in (5.120), 

( (f - b) l c dfj, = f _ (/ - b ) lc d/v, ^ 0, 

J d{((/-fc)ic)>o} 

and hence b/j,(C) ^ $ c /d/n Since (5.120) entails $ c /d/r ^ ag,(C), we obtain 
bg, (C) < a/i (C). Since b > a and /./, ((7) we get /i = 0. The subset C'o 

dehned by 

t i n-i I n—i 1 

C'o = 1 liminf — V* A fc / < limsup — y A fc / > 
i ra_> °° n fc=o n ^°° n fc=0 j 


can be written in the form 


C n = 


u 


c’ at , 


—cc<a<b<cc, aA 


where the symbol Q denotes the set of rational numbers. So, by what is proved 
above the set C'o can be covered by a countable collection of subsets of the form 
C f ab , —co < a < b < co, all of which have /u-measure 0. Whence, ji (C'o) = 0. So 
the pointwise limit in (5.114) exists /.(-almost everywhere. 

(a) Next we prove assertion (a), and therefore P^f belongs to L 1 (Q, 'J. /./). By 
Fatou’s lemma we have 


/■» r* -i TL 1 /■» -« TL 1 

\P»f\ dn = lim - V S k f d/7, < liminf - Y | S k f\ 

j j n—»oo J7, J n—>cQ n ^ 


dfi 
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^ liminf 

n—»oo n 


-< Tl —1 r* -i Tl —1 r» r* 

- \S k f\ dfi liminf - V \ \f\ dfi = \f\ dfi. 

(5.121) 

The inequality in (5.121) shows property (a). 

The fact that P ® follows from (b). First let / ^ 0 belong to L 1 (12, T, //). 

Then 

n— 1 n— 1 

Pnf = liminf — V* = sup inf min — V* S' fc /, 

^ n ^> n /-J J AJ'>/V N<r,<N' n J ’ 


and hence 


/c=0 


-| nl ^ n —j. 

SR/ = liminf -Y S k f = sup inf min - V S fe+1 / 


TV N'^N N^n^N' n 


k =0 


n—1 


n 


k =0 
n—1 


TV N'^N N^n^N' n 


k =0 


liminf - V S fc+1 / = P,Sf = P„f, 

n ^>oo Ti 


k =0 


which implies property (b) for non-negative functions in L l (12, T. //). A general 
function can be written as a difference of non-negative functions in L 1 (12, T, //). 
This proves property (b). 
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Next we assume that // (Q) = 1. First we will show that the pointwise limit in 
(5.114) is in fact also an ZT-limit. For this purpose we fix / e L 1 (fi, T, /i) and 
a real number M > 0. Then we have 


I 






n— 1 


- y s k f - />„/ 

71 /c=0 

J (is/'4 (/‘d/K")) 

I (^S^-^j (JhfKM)) 


^+J ^S s ‘- p <‘)(Ai/i>«i) 


d/i 


dg 2 


I 


I/I dfx. 


(5.122) 




Since |g| < M, g e L 1 (f2,T, p) implies 151/1 < M and |P M g| < M, the integrand 
in the first term of the right-hand side of ( 5 . 122 ) is dominated by the constant 
I 1 -function 2 M. So from Lebesgue’s dominated convergence theorem, (5.114) 
and ( 5 . 122 ) it follows that 


^ 2 f |/| dg. (5.123) 

Since ill > 0 is arbitrary and / eh 1 (12, T, g) the inequality in (5.123) implies 

= 0, 

which is the same as saying that the limit in (5.114) also holds in Id-sense. The 
equality P^f = [/ | U], / e L 1 (12, T. g ), follows from the following two facts: 


f 1 n_1 

limsup -YS k f-P^f 

rwcc J 


d/i 


lim sup f 1 V S*/ - P^f 

n^co J n “ 


dg 


(a) the collection {del: 51,4 = 14 } is a cr-field, which is readily estab¬ 
lished. 

(b) Moreover, if / ^ 0 is such that Sf = f , and if a > 0, then 51{/ >c q = 
l{S/>a} = l{/>a}- 


This completes the proof of Theorem 5.66. 


□ 


4. Projective limits of probability distributions 

This section is dedicated to a proof of Kolmogorov’s extension theorem. We 
will also present Carathedory’s extension theorem. Let I be an arbitrary set 
of indices. Denote by “K (I) the class of all finite subsets of I, by 3T (I) the 
collection of all countable subsets of I, and by “K" (I) = 2 1 the class of all 
subsets of I. Consider a collection of measurable spaces (12;, Ai) indexed by 
i e I. For J e W (I) we write Clj = OjeJ - > an d f° r J c K c I we denote 
by p^T the canonical projection of 12^ onto f2j. If J = {/} c K we write p5 
instead of pF,, and if It = I we write pj instead of Pj. Hence p y denotes 
the (one-dimensional-)projection of 12/ on its j -th coordinate flj. Often these 
coordinate functions {pj : j e 1} serve as a canonical stochastic process. On 
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each flj, J e TC" (/), we consider the cr-field Aj = ®j e jAj, generated by the set 
of projections {pj : j e ./}, i.e. the smallest cr-field containing the sets 

{(p/)” 1 (-4) : j e J, A<= A/} = {{pj e A} : j e J, A e Aj} . 

The collection Aj is called the product a -algebra on Qj. One easily sees that, 
if J g BC (/), then Aj is generated by the set of rectangles, i.e. by 



If J a K a L a /, then we clearly have 

Pj = p{j°Pk- (5.124) 

It is easily seen that the projection pfi , where K, J e TC" (/), J a K, from fix 
onto ilj is measurable for the a -fields Ak and Aj. The latter is also written 
as: pj is AwAj-mcasurable. On 0/ we consider two classes of subsets: 

B = {{pj e A} = p^ 1 (A) : J g BC (/), A e Aj} , and (5.125) 

B 7 = {{pj e ^4} = Pj 1 (-'4) : J e 3C (-0 ? A g Aj} . (5.126) 

The subsets belonging to B are called cylinders or cylinder sets. If Z e B, 
respectively Z e B', then there exists J e J~C (I), respectively J' e BF (/), such 
that 

Z = A x n AJ . (5.127) 

The inclusions B c= B' c= Aj are obvious. 

5.68. Definition. Let B be a subset of the powerset of flj. Then B is called a 
Boolean algebra , if it is closed under finite union, and under taking complements. 

5.69. Lemma. The set B is a Boolean algebra, B' is a cr-field, and 

a {B} = B' = Aj. (5.128) 

PROOF. First we show that B is a Boolean algebra. Let Z = ( pj)~ l {A) = 

[pj g A), J g Bf (/), A g Aj, be a cylinder. Then 

:= n f \Z = n f \ (pj)- 1 (A) = {;pj g flj\Aj = pj 1 (.A c ), 

which shows that Z c belongs to B whenever Z e B. Furthermore, let Z, = 
pj. 1 (A), A g Bf(/), Ai g Aj a i = 1 be n cylinders. Then for J = 

J\ u • • • u J n we have 

Z\ U • • • u Z n = p-j] (A) U • • • u p~jl (A n ) 

= Pj 1 (piV 1 (A) u • • • u pj 1 (■ A n ) 

= Pj 1 ((PjJ” 1 (A) u • • • C P J Jn )~ 1 (A»)) • (5-129) 

Since the sets pj. 1 (A*) belong to ^4j for i = 1,..., n the set Zi u • • • is a 
cylinder. 

In the same manner one proves that S' is a cr-field. 
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In order to get the equalities in (5.128) it remains to show that Ai c <r{23}. 
Considering the definition of Ai it is sufficient to prove that pi, i e I, is mea¬ 
surable for <7 {23} and Aj. However, this follows from (5.125). So the proof of 
Lemma 5.69 is complete now. □ 

5.70. Remark. The fact that 23' = Aj is important. It shows that each B e Aj 
only depends on at most a countable number of indices, in the sense that B can 
be written as B = A x Qj\j where J is countable or finite and where A e Aj. 

The observation in this remark shows that the product cr-field is relatively 
“poor” when the index set I is uncountable. The following two examples will 
clarify this. 

5.71. Example. Take I uncountable, let each flj, i e I, be an arbitrary topo¬ 
logical Hausdorff space with at least two points, and let A,, be the Borel field 
of Q, . For every i e I we select e !!,. Since the singleton {{u>i) ieI } is a closed 
subset of fli with respect to the product topology, it belongs to the Borel cr-field 
of f2j. But it does not belong to Aj because it cannot be written as the set B 
in Remark 5.70. 
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5.72. Example. Take I = [0, oo) and suppose that Q, t = 12, where O is a 
topological Hausdorff space consisting of at least two points. Hence Q| 0: x) = 
fit 0 ’ 00 ) is the set of all mappings from [0, go) to 12. Let B be the subset of fit 0,00 ) 
consisting of all right-continuous (or all continuous) mappings from [0, go) to Cl. 
Assuming that B belongs to A[o i00 ) = A®[° ,0 °), where A is the Borel a-he Id of Cl 
will lead to a contradiction. Because, if B belongs to A[o (00 ), then by Remark 
5.70 B is of the form B = Ax 12[o !0 o)\j where J a [0, go) is countable, and where 
A e Aj. We may suppose that J contains all rational numbers. Pick / e B and 
t g [0,co)\J. We define the function g : [0, oo) —> Cl as follows: g(s) = f(s ) 
if s # and g(s) A f(t) if s = t. Then g e B, but it is not right-continuous, 
which can be seen as follows. In J there exists a sequence [t n ) n which decreases 
to t. Then 

lim g (t n ) = lim f (t n ) = f(t ) A g(t). 

n—> go n—* oo 

It follows that B $ A[o ;00 ). 


5.73. Definition. Consider a family of measurable spaces (12*, A*), i e I. Sup¬ 
pose that for every J e dC (/) Fj is a probability measure on ( Clj,Aj ) such 


that 


Pa [p* G A] = 


WY\A) 


Pj [A ], 


(5.130) 


whenever J, K e J~C (/), J a K , and A e Aj. Then the family {Pj : J g TC (/)}, 
or the family {(f2j, Aj, Pj) : J e J~C (/)}, is called a projective system of proba¬ 
bility measures, or spaces. Such a system is also called a consistent system, or 
a cylindrical measure. 


The following theorem says that a cylinder measure is a genuine measure pro¬ 
vided that the spaces Cli are topological Hausdorff spaces which are Polish, 
endowed with their Borel a-ficlds 23,. From Theorem 5.51 it follows that all 
probability measures g, on a Polish space S are inner and outer regular in the 
sense that 


ji{B) = sup {g(K) : K a B, K compact} = inf {g(0) : O 3 B, O open} 

(5.131) 

whenever B belongs to the Borel cr-field of S. The following theorem is a slight 
reformulation of Theorem 3.1. We also make the following observations. A 
second-countable locally-compact Hausdorff space is Polish. See Theorem 1.16, 
and see the formula in (1.18) which gives the metric. As mentioned earlier this 
construction can be found in Garrett [57]. 


A countable disjoint union of Polish spaces ( Ej , d 3 ) is Polish, with metric 


d(x,y) 


1 , (for x, y in distinct spaces in the union), 

d n (x, y ) (for x, y in the nth space in the union). 


(5.132) 


Here we assume that dj(x,y ) A 1. x, y e Ej. From this result it follows that a 
(j-compact metrizable Hausdorff space E = 'jf =x K :r Kj a Kj +i , Kj compact, 
is a Souslin space, i.e. a continuous image of a Polish space. This is so because 
every subset Kj is compact metrizable, and therefore separable. Therefore the 
complements Kj + i\Kj, j e N, are Polish, and since E is the disjoint union of 
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such spaces E itself is Polish. It is known that probability measures on the class 
of Borel subsets of Souslin spaces are regular. For details the reader is referred 
to Bogachev [21]. Before we formulate and prove the Kolmogorov’s extension 
theorem we will discuss the Caratheodory’s extension theorem. We need the 
notion of semi-ring , ring , and (Boolean) algebra of subsets of a given set 12. 

5.74. Definition (Definitions). Let 12 be a given set. A semi-ring is a subset 
S of fP(O) , the power set of 12, which has the following properties: 

(i) 0 e S; 

(ii) For all A, B e S, the intersection An B belongs to S (S is closed under 
pairwise intersections); 

(iii) For all A, B e S, there exist disjoint sets e S, with i = 1 , 2, ..., n, 
such that A\B = (J” =1 I\ t (relative complements can be written as 
finite disjoint unions). 

A ring Ik is a subset of the power set of 12 which has the following properties: 

(i) 0 g Ik; 

(ii) For all A, B e fk, the union A u B belongs to Ik (fk is closed under 
pairwise unions); 

(iii) For all A, B e Ik, the relative complement A\B belongs to Ik (Ik is 
closed under relative complements). 

Thus any ring on 12 is also a semi-ring. 

A Boolean algebra 23 is defined as a subset of the power set of 12 with the 
following properties: 

(i) 0 G 23; 

(ii) For all A e 23 and B e 23 the union A u £> belongs to 23; 

(iii) If A belongs to 23, then its complement A c = 12\A belongs to 23. 

Sometimes, the following constraint is added in the measure theory context: 12 
is the disjoint union of a countable family of sets in S. 

Without proof we mention some properties. Arbitrary (possibly uncountable) 
intersections of rings on 12 are still rings on 12. If A is a non-empty subset of 
1P(12), then we define the ring generated by A (noted Ik (A)) as the smallest 
ring containing A. It is straightforward to see that the ring generated by A is 
equivalent to the intersection of all rings containing A. 

For a semi-ring S, the set containing all finite disjoint union of sets of S is the 
ring generated by S: 


Ik (S) — j A : A — AjG§>. 


2 = 1 


This means that 31 (S) is simply the set containing all finite unions of sets in S. 
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A content // defined on a semi-ring S can be extended on the ring generated by 
S. Such an extension is unique. The extended content is necessarily given by: 

n n 

H(A) = for A = U Ai, with the A, e§ ! s mutually disjoint. 

i —1 i —1 

In addition, it can be proved that /i is a pre-measure if and only if the extended 
content is also a pre-measure, and that any pre-measure on 31 (S) that extends 
the pre-measure on S is necessarily of this form. 

Some motivation is at place here. In measure theory, one is usually not inter¬ 
ested in semi-rings and rings themselves, but rather in <r-algebras (or cr-fields) 
generated by them. The idea is that it is possible to build a pre-measure on a 
semi-ring S (for example Stieltjes measures), which can then be extended to a 
pre-measure on 31 (§), which can finally be extended to a genuine measure on a 
a-algcbra through Caratheeodory’s extension theorem. As a-algebras generated 
by semi-rings and rings are the same, the difference does not really matter (in 
the measure theory context at least). Actually, the Caratheodory’s extension 
theorem can be slightly generalized by replacing ring with semi-ring. 
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5.75. Definition. Let S be a semi-ring in CP (Q). A pre-measure on S is a map 
/i : S —> [0, oo] such that 

(i) n{0) = 0. 

(ii) If ( A n ) n is a mutually disjoint sequence in S, and if A := (J n A n belongs 

N 

to S, then /i(A) = // (A n ) = ^lim /i (A n ). 

n=l 

We also need the concept of outer or exterior measure. 

5.76. Definition. An outer measure on IP (f2) is a map A : IP (f2) —» [0, go] with 
the following properties: 

(i) A(0)-O. 

(ii) A c B implies A (A) ^ A (B). 

(hi) If (A n ) n is a sequence in ? (Cl), then A ((J n A n ) < Xln ^ CU- 

By taking all but finitely many A n to be the empty set one sees that an outer 
measure is sub-additive: A (A u B) ^ A(A) + A (B), A, B e 7 (O). Let A be an 
outer measure on IP (O). We define to be the set of all subsets A a Q such 
that for any DcSlwe have 

X(D) = X(An D) + X(A C n D). (5.133) 

Since an outer measure A is sub-additive we may replace the equality in (5.133) 
be an inequality of the form 

X(D) > X(AnD) + X(A c nD). (5.134) 

In other words, T,\ consists of all subsets icSl that split in two in a good 
way. Clearly, Q e and by the very form of the definition of Y,\, we have a 
subset A belongs to T,\ if and only if its complement A c belongs to £>. We now 
present the following proposition, whose proof is a bit tedious. For details the 
reader is referred to, e.g., [5] or [10]. The reader may also want to consult the 
Probability Tutorials by Noel Vaillant: http://www.probability.net/. The 
sets in are called Caratheodory measurable relative to the outer measure A. 

5.77. Proposition. Let A be an outer measure on Cl, and let £^ be as defined 

above. Then a cr-algebra on Cl. 

The Lebesgue-Stieltjes integral f(x) dg(x) is defined when / : [o, b] —*■ M. 
is Borel-measurable and bounded and g : [a, b] M is of bounded variation 
in [a, b] and right-continuous, or when / is Borel-measurable and non-negative 
and g is non-decreasing, and right-continuous. Define w((s,t]) := g(t) — g(s) 
and tv ({ a )) := 0 (Alternatively, the construction works for g left-continuous, 
te([s,f)) := g(t) — g(s) and w({b}) := 0). By Caratheodory’s extension theorem 
(Theorem 5.79), there is a unique Borel measure fi, g on [a,b] which agrees with 
w on every interval I c [a,b]. The measure fi g arises from the outer measure 

= inf j X> 9 W) : 
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where the infimum is taken over all coverings of E by countably many semi-open 
intervals I,. This measure is sometimes called the Lebesgue-Stieltjes, or Stielt- 
jes measure associated with g. The Lebesgue-Stieltjes integral f(x) dg(x) is 
defined as the Lebesgue integral of / with respect to the measure g g in the usual 
way. If g is non-increasing, then define § f(x)dg(x ) := — §f(x)d(—g)(x). If 
the function g : [a, b] —> M is right-continuous, and of bounded variation on 
[a, b], then g may be written in the form g = g\ — g 2 , where the functions g\ 
and #2 are monotone non-decreasing and right-continuous. So that g g (s, t] = 
g(t) — g(s), a E s < tE b, extends to a real-valued measure on the Borel field 
of [a, b]. Of course, if g were right-continuous, complex-valued and of bounded 
variation, then g can be split as follows g = Re g + ilm g = gi — g 2 + i (<73 — < 74 ) 
where the functions g g , 1 ^ j ^ 4, are right-continuous, and non-decreasing. 
To the function g we can associate a complex-valued measure g g such that 
H g (s,t] = g(t) — g(s), a < s < t ^ b. For more details on Riemann-Stieltjes 
integrals the reader is referred to [ 130 ]. The book by Tao [ 137 ] contains a 
discussion on Stieltjes measures. 

The definition of semi-ring may seem a bit convoluted, but the following simple 
example shows why it is useful. 

5.78. Example. Think about the subset of T (M) defined by the set of all half¬ 
open intervals (a, b] for a and b reals. This is a semi-ring, but not a ring. Stieltjes 
measures are defined on intervals; the countable additivity on the semi-ring is 
not too difficult to prove because we only consider countable unions of intervals 
which are intervals themselves. Proving it for arbitrary countably union of 
intervals is proved using Caratheodory’s extension theorem. 

Now we are ready to formulate the Caratheodory’s extension theorem. 

5.79. Theorem (Caratheodory’s extension theorem). Let Li be a ring on fl 
and /i : Li —> [0, 00 ] be a pre-measure on a Li. Then there exists a measure 
/// : a (fk) —> [0, go] such that g! is an extension of g. (That is, g! | = g). Here 
g (IR) is the a-algebra generated by Li. 

If g is a-finite then the extension g' is unique (and also a-finite). 

If Ik is a Boolean algebra, then Theorem 5.79 is also called the Hahn-Kolmogorov 
extension theorem. A complete proof can also be found in [ 21 ] Theorem 1.5.6. 
We will present just an outline. Another interesting book is Tao [ 137 ]; in partic¬ 
ular see Theorems 1.7.3 (Caratheodory’s extension theorem) and 1.7.8 together 
with Exercise 1.7.7 (Hahn-Kolmogorov’s extension theorem). An (older) paper, 
which treats Caratheodory’s extension theorem thoroughly, is Maharam [ 90 ]. 

Proof. The proof is based on the cr-field corresponding to the outer (or 
exterior) measure associated to pre-measure g. This exterior measure g* is 
defined by 

{ 00 00 ^ 

2 M (A k ) : A k E % A c= U A k > , A c= Vt. (5.135) 

k= 1 k= 1 J 
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(If A can not be covered by a countable union of sets in 1R, then we put /j* (-4) = 
oo.) Then it is not too difficult to prove that p* is an outer measure. Like in 
Proposition 5.77 let E^* be the a -field consisting of those subsets A of D for 
which 

p* (D) ^/u*(AnD) + /j* (A c n D ) 

for all D a fl. Then it follows that the cr-field E ;i * contains the ring T. Put 
/// (B) = //* ( B ), B e E^*. Then y! is a measure on E^* which extends /i, and 
which unique provided that y is cr-finite. For details see [21] Theorem 1.5.6. 
This concludes an outline of the proof of Theorem 5.79. □ 

5.80. Example. Let E be a cx-compact topological Hausdorff space, and assume 
that each compact subset Kj is metrizable, and hence separable. Define the 
sequence of open subsets (Oj) . of E as follows: Oo = 0, 0\ = E\K\, Oj+i = 
(E\Ki) n • • • n ( E\Kj ), j ^ 1. Then, for an appropriate metric (x, y ) > dj{x , y), 

x, y e Kj n Oj, 0 < dj(x , y) A 1, the spaces K :) n 0 3 is complete metrizable and 
separable, and so a Polish space. Moreover, by construction the spaces Kj n Oj , 
j = 0, 1, ..., are mutually disjoint, and so the E can be supplied with the metric 
d(x,y ) defined by d(x,y ) = 1 , if x, y belong to different spaces Kj n Oj, and 
d{x,y ) = dj(x,y), if x and y belong to Kj n Oj, j = 0, 1, .... Then this metric 
turns E written as a disjoint union of Kj n Oj into a Polish space. Its topology 
is stronger than the original one, and hence E itself is continuous image of a 
Polish space (via the identity map). It follows that E is a Souslin space. 




MAERSK 


I joined MITAS because 
I wanted real responsibility 


The Graduate Programme 
for Engineers and Geoscientists 

www.discovermitas.com 


Real work 
International opportunities 
Three work placements 




a 


I was a construction 
supervisor in 
the North Sea 
advising and 
helping foremen 
solve problems 


365 



Download free eBooks at bookboon.com 







Advanced stochastic processes: Part II 


Some related results 


Example 5.80 should be compared with the notion of disjoint unions of Polish 
spaces are again Polish: see (5.132). 

5.81. Theorem (Kolmogorov’s extension theorem). Let 

{(fi^B^Pj) : Je IK (I)} 

be a projective system of probability spaces. Suppose that for every i e I, is 
a Polish space (or Souslin space) endowed with its Borel a-field Bj. Then there 
exists a unique probability measure P 7 on (f2 7 , B 7 ) such that 

Pi[p J eA] = P I [pj\A)]=Pj[A], A e Bj, (5.136) 

for every J e BC (/). 


PROOF. If Z = {pj e ^4} = p 3 1 (^4), J e 0i (/), A e Bj, is a cylinder in fl 7 , 
then we define P 7 [Z] by 

Pj [Z] = Pj [A ]. (5.137) 

This definition is unambiguous. Indeed, let 
Z = {pj e A} = pj 1 (A) = pf) (. B) = {p K 6 B} , with AeBj and B e B K , 


with J, K e “K (I). We have to show that P./ [B] = P K [B], Indeed, with 
L = J u K, we get 


Z = Pf 1 (pj ) _1 ( A ) = {pj o p L e A} 

= {Pj e A} = {p L j)~ l ( A) = (jp L K y 1 (B) = {; Pk e B} 

= {Pk°Pl eB}= PZ 1 K )” 1 (#) • 

From (5.130) together with (5.138) we infer 


rj[A]=V L (pj) 1 (^ 4 ) 


p, 


(p L kY\b) = P*[B]. 


The equality in (5.139) shows that P 7 is well defined. We also have 


(5.138) 

(5.139) 


Pj [«/] = Pj [pf 1 (fii)] = = 1. 


Next we show that P 7 is finitely additive on B, the collection of cylinders. Let 
Z = pj 1 (A), with J e J~C (/) and A e B j, and Z' = pf) (B), with K e BC (/) 
and B e B k be two disjoint cylinders. Put L = J u K. Then we have 

0 = z n Z’ = pf 1 (p L j)~ l (. A ) n (pi )” 1 ( 5 ) 

= Pl 1 ((Pj )” 1 O) n (: Pk ) _1 (^)) • (5.140) 

From (5.140) we infer (pj) 1 (A) n WF 1 (B) = 0. Consequently, we obtain 


P 7 [Z u B'] = P 7 [pf 1 (p 1 ))- 1 (A) u pf 1 (p L K )- L (B) 
= Pj P~l ({Pj) 1 ( a ) u ( Pk )~ 1 (#)) 
= [(Pj )” 1 (^) u CPr)~ 1 ( 5 ) 


.-i \-i 


= Pr 


W) _1 (^) 


+ Pl 


( 4 ) _ 1 ( 5 ) 


366 


Download free eBooks at bookboon.com 
















Advanced stochastic processes: Part II 


Some related results 


(apply the equality in (5.130)) 


= P 7 [A] + P* [B] = P, [A] + P/ [B ]. (5.141) 

The equality in (5.141) proves the finite additivity of the mapping Pj on the 
collection of cylinder sets B. 


Finally we prove that the mapping P/ is cr-additive on B. For that purpose we 
consider a decreasing sequence (Z n ) n of cylinder sets such that Pj [Z n \ Y a > 0 
for all n e N. We will show that Z n 0. By contraposition it then 
follows that f~) n Z n = 0 implies lim„^, x , Pj [Z n ] = 0. For each n we have 
Z n = pj 1 (An) with J n e BC (/) and A n e Bj n . Of course we may suppose that 
Ji c J 2 c= • • • c= J n a ■ ■ ■ . Put J = (J n J n . Then 

Z n = Pj 1 (PjJ” 1 (4») = (PjJ” 1 ( A n) X 


Since 



(Pjn) ( An ) 


X &I\J 


(5.142) 


we see that f) n Z n =£ 0 if and only if f~) rt (pj n ) 1 ( A n) A 0. This means that 
our problem is reduced to the problem with / = J, i.e. to a countable problem. 
For every meN there exists a compact subset L ]m of il ]rn with 


Then L := Om Aim i s a compact subset of Qj. Furthermore, for every n e N 
we have 


S 2 PAij] < j- (5.143) 

jtJn 

On the other hand for every n e N we choose a compact subset K n of A n (in 
Bj n ) such that 

P* [A.V4] ' 0^- (5.144) 

For every neN the set Y n defined by 

4 - (P *)" 1 (Ki) n • • • n (?£_,)'' <A'„-i) n K n 

is a closed subset Qj n , and so Z' n := pj! (Y n ) is a closed cylinder in Qj, and 
Z' n cr Z n . In addition, we have 

Z'n = Pj! (Ki) n • • • n p-jl, (Kn- 1 ) n pj! (K n ) 
the sequence (Z' n ) n is decreasing. We also have 


P.j 


IW 


Kj^Jn 


= Pj 


U (pi 


*) _1 (q) 


_jE.J n 


V, [Zn\Z' n ] - V, 


z »\( n zp K t) 




< 


£ E, [Z„VJ,‘ (44)] « £ E, [Z ^ 1 (A' t )] 


k =1 


fc=l 
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- 2 E, [p-jl (A k ) >7/ (if*)] - 2 E, K (AV4)] 

k —1 k —1 

n 

- 2 E* [-4AAJ < f • (5.145) 

fc=l 

Since, by assumption, Pj [Z„] ^ a, (5.145) implies 

Pj. K] = P, [Z(J = P, [Z„] - P, [Z n \Z' n ] > a - J (5.146) 


Since, by (5.146) and (5.143) we have 


Pi 


j^Jn 


\jeJn 


> 1 - Pi Kl - Pi, 

it follows that T„ nrw Lj ^ 0. Moreover, observe that 


3a a a 

^T-4-2- (5147) 


z;,ni = (y;xp|) = ( y »flhiM x El ^ < 5 - 148 ) 

\ m / \ j^Jn / jEj\J n 


and consequently, Z' n n L / 0 . Hence, the decreasing sequence ( Z' n n L) n 
consists of non-empty compact subsets of flj. By compactness we get that 
f\ Z'n n L 7 ^ 0. So we infer 

n^(A^n^n<^ n L) # 0. (5.149) 

n n n 

As a consequence of the previous arguments, we see that Pj is a cr-additive 
on the Boolean algebra B which consists of cylinders in !!/-■ This measure P j 
satisfies (5.136). By the classical Caratheodory theorem the mapping Pj extends 
in a unique fashion as a probability measure on the cr-field a {13} = B/. Then, 
technically speaking, the mapping P/, defined on the Boolean algebra 13, is a 
pre-measure. This corresponding exterior measure P* is defined by 


P} (A) = inf 


00 

2 

> = 1 


/i (Zfc) : Zfc e B, A <z Zfc 


fe=i 


(5.150) 


Then it is not so difficult to prove that the set function defined by (5.150) is an 
outer measure indeed. Define the associated cr-field T> by 


Bi = {Adl j: P *{D) ^ P} (A n D) + P* (A c n D) : for all D c D/} . 

(5.151) 

The fact that T> is a cr-field indeed follows from Proposition 5.77: see Theorem 
5.79 as well. It is fairly easy to see that B> contains the Boolean algebra B which 
consists of the cylinder sets in 0/-. 

This completes the proof of Theorem 5.81. □ 
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5. Uniform integrability 

The next Theorem is often used as a replacement for the dominated convergence 
theorem of Lebesgue. 

5.82. Theorem (Theorem of Scheffe). Let (12, T, p) be an arbitrary measure 
space and let (f n : n e N) be a sequence of non-negative functions in L 1 ^, T, p). 
In addition, let the function f belong to L 1 (12, T, p). Suppose that f(x) = 
lim n ^oo f n (x) for p-almost all x e Cl. The following assertions are equivalent: 

(i) lim n ^oo J | f n - f\dp = 0; 

(ii) The sequence (f n :neN) is uniformly integrable; 

(hi) hm n ^oo J f n dp = $ fdp. 

Instead of uniformly integrable the term equi-integrable is often used. A family 
( f a : a e A) in L l (12, T, //) is uniformly integrable, if for every e > 0 there exists 
a function g ^ 0 in L 1 (12, T, p) such that >g , \f a \ dp < e for all a e A. 

5.83. Proposition. If p is a probability measure, then a family (f a \ ae A) in 
L 1 (12, T, p) is uniformly integrable, if and only if for every e > 0 there exists a 
constant M e ^ 0 such that | >M ^ \f a \ dp ^ e for all a e A. 
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Proof. The sufficiency is clear: choose for g E a constant function M e . Next 
we show that, if the family (f a : a e A) is uniformly integrable, then necessarily 
for every e > 0 there exists a constant M e such that 


1 


\fa\ dfi £, a e A. (5.152) 

>{\U\>M e } 

Fix £ > 0. By hypothesis we know that there exists a function g £ e L 1 (fl, T, //), 
g e > 0, such that 


1 


\f a \ d/d < aeA. 


’{\fa\>9e} 

Then we choose M e so large that 


J 


J{g £ ^M £ 

Then by (5.153) and (5.154) we have 


Qed^l < 


(5.153) 


(5.154) 


\fa\ dg, = 

A\f a \>M e } J{ 

<1 


{M £ ^\f a \<nmx(M £ ,g £ )} 

g e d/d + 


| fa | djl + 

J{ 


\fa\ d/d 


{g £ >M £ 


I, 


{ | fcx | 


{|/ a |^max(M e , 0 e )} 

I fa | d/d | + | = e. (5.155) 


The inequality in (5.155 completes the proof of Proposition 5.83. 


□ 


Proof of Theorem 5.82. (i) => (ii). Put g = sup neN f n - The following 
inequalities hold for m e N: 


I 






< 


( \f n —f\dfi+ r 
J I fn - f\dg + 


I 

f | fn ~ f\dfd+ f 

J J( 


f d/d 


fdg 




fd[d. 


(5.156) 




Let e > 0, but arbitrary. By (i) there exists iV(e) e N such that $ |/ n — f\ d/d < 
e/2 for n > iV(e) + 1. The inequalities below then follow for m > M(e): 


fd/d T e/2, and 
J{g>mf} J{ 




fndjd < e, 1 < n < N(e). (5.157) 


From (5.156) and (5.157) we see >M(e)/} .C 0 / 7 ^ e - But this means that the 
sequence (/ n : n e N) is uniformly integrable. 

(ii) =^> (iii). Let e > 0 be arbitrary and choose a function g e e L 1 (f2, T, /i) such 
that 


f n dfd + 

d{fn^ge} A 


fdn ^ e. 


(5.158) 


{fn^9e 


From (5.158) we obtain 


J fndg - J fdjd < J 


\fn f\ dfd T 




fndfd T 


fd/d 


{fringe 
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^ I fn~ f I dfi + e. (5.159) 

d {fringe} 

By the theorem of dominated convergence, it follows from (5.159) that 

lim sup 

n —>oo 

Since e is arbitrary assertion (iii) follows. The same argumentation shows the 
implication (ii) => (i). 

(iii) => (i). The equality 

\fn ~ f\ = fn ~ f + 2 (/ - min (/, /„)) 

is obvious. From (iii) together with the theorem of dominated convergence it 
then follows that 

lim f | f n - f\ dp. 

n—»°o J 

= lim [ (f n - f)dju + 2 f lim (/ - min (/, /„)) d[d = 0. 

n —>°° J J n —>oo 

The proof of Theorem 5.82 is now complete. □ 

5.84. Corollary. Let (/ d m : m e N) be a sequence of probability measures on 
the Borel a-field of W. Let every measure /d m have a probability density g m 
relative to the Lebesgue measure A. Furthermore, let g ^ 0 be a probability 
density. Suppose that for X-almost all ieK 1 ' the equality lim m ^.oo g m (x) = g(x) 
is true. Let the measure fd have density g. Then the sequence (/x m :meN) 
converges weakly to p. 

Proof. From the theorem of Scheffe (Theorem 5.82) we see 

lim | g m (x) -g(x)\dx = 0. 

ra—KX) J 

Let / be a bounded continuous function. Then 

< ll/lloo J \9m(x) -g(x)\dx. 

(5.160) 

The assertion in Corollary 5.84 follows from (5.160). □ 

5.85. Theorem. Let (X m : me N) be a sequence of stochastic variables, which 
are defined a probability space (f2,T, P). 

(a) If the sequence (X m :meN) converges in probability to a stochastic 
variable X, then the sequence of probability measures (Px m : m e N) 
converges weakly to the distribution Px; 

(b) If the sequence (Px m '.me N) converges vaguely to the Dirac-measure 
S a , then the sequence (X m : m e N) converges in probability to a sto¬ 
chastic variable X, which is P -almost surely equal to the constant a. 


J fdii m - J fdp = f 


(f(x)9m(x) ~ f(x)g(x)) dx 


J f n d/d - J 


fd/d 


< e. 
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Proof, (a) Suppose that the sequence (X m :meN) converges in proba¬ 
bility to X. We pick / 6 G' 0 o (R") and we will prove that liin m ^ x $ fdFx m = 
J fdFx ■ The latter is equivalent to lim m ^oo $ f(X m ) dP = §f(X) dP. The func¬ 
tion / is uniformly continuous. So, for e > 0 given, there exists 8 > 0 such 
that 

\x 2 — X\ | < 8 impliceert \f(x 2 ) — f(x i)| < e. (5.161) 

Put A m = {|X — X, m | ^8}. For u $ A m the inequality 

holds. From this it follows that 

<f \f(X)-F(X m )\dP+ f \f(X) — f (X m ) \ dP 

JAm 

<£P(^) + 2||/|| 00 P{|X m -U><S} 

<€ + 2||/|| a) P{|X m -X|>i}. (5.162) 

The assertion in (a) follows from (5.162) together with assertion (3) in Theorem 
5.43. 


f fd Px„. - f fdPx 
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(b) Suppose that the sequence (Px m : m e N) vaguely converges to the Dirac- 
measure 8 a . Let /(e) be the interval /(e) = [a — e, a + e] and choose functions 
/ and g e Coo (IB/') such that / < lj < g and such that /(a) = g(a) = 1. Then 
the equalities follow: 

/(a) = liminf f fdF Xm < hrn inf F Xm (/) limsupP Xm (I) 

m J m m 

^ limsup I gdF Xm = g(a). (5.163) 

m J 

From (5.163) it follows that 

limP (\X m — a| < e) = 1, 

m 

which amounts to the same as 

limP (\X m — a\ > e) = 0. 

m 

This proves assertion (b). So the proof of Theorem 5.85 is now complete. □ 

6. Stochastic processes 

We begin with some definitions. 

5.86. Definition. Let (fl,T, P) be a probability space, and let (E,fL) be a 
locally compact Hausdorff space, that satisfies the second countability axiom, 
with Borel cr-field £. Often E will be chosen as M or as I". A stochastic 
process X with values in the state space E is a mapping X : [0, oo) x Q, —*• E. 
For every u) e fl the mapping t i—> X(t,u>) defines a path of the process. A 
path is sometimes also called a realization. If we fix n e N, then the mappings 
Pf lv .. jtn : £ 0 • — 0 £ —» [0,1], where (ti,... ,t n ) varies over [0,co) n , and which 

nx 

are defined by 

P = P{(X(C),.. .,X(t n )) eB), Be£®-.-®£, (5.164) 

v -v- y 

nx 

are called the n-dimensional distributions of the process X. Here X(t) is the 
mapping X(t)(oj) = X(t, u), u> e O. 

Sometimes we write X t instead of X(t). If n = 1, then the distributions in 
(5.164) are also called the marginal distributions, or marginals. However, no¬ 
tice that a process is much more than the corresponding collection of finite¬ 
dimensional distributions. In particular the paths or realizations of a process 
are very important. For example, the continuity properties of the paths are 
relevant. Often we will suppose that the paths are continuous, or that they are 
continuous from the right, and possess limits from the left {cadlag paths}, or 
cadlag paths. So that the process X is cadlag provided that for all t ^ 0 the 
equality lim^ X(s) = X(t) holds P-almost surely (this is continuity from the 
right, or continue a droite in French) and if the limit lim s | t X(s) exists in E 
(this means that the left limits exist in E, limite a gauche in French). 
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5.87. Definition. A family sub-cx-fields (T t : t ^ 0) of T is called a filtration (or, 

sometimes, also called history ), if t < s implies T t c= 37,.. Thus the probability 
P is defined on all o -fields T t . With Too, or also Too- the cr-field generated 
by Ut*> T t is meant. If for every t Y 0 the equality ‘J t = n s>t T s holds, 

then the filtration (Tt : t y 0) is called continuous from the right, or right- 

continuous. Let (Tt : t ^ 0) be a filtration, and put “J t+ = f] s>t T s . Then the 
family (T t+ : t ^ 0) is a right-continuous filtration. This filtration is called the 
right closure of the filtration (T t : t y 0). A subset A of ft is called a P -null 
set if there exists a subset A 0 e T with the following properties: A cr A 0 and 
P[A 0 ] = 0. Usually this is expressed by saying that A is a null set instead 
of A is a P-null set. Often it is assumed that To contains all null sets, and 
that the filtration (T t : t ^ 0) is right-continuous. Sometimes it i said that To 
has the usual properties. The process A is called adapted to the filtration 
(T t : t ^ 0) if for every t y 0 the state variable X (t) is measurable with respect 
to u-fields Tt and £. Let J~C t = o (A(u) : 0 < u < t) be the a-field generated 
by the state variables X(u), 0 < u < t. The filtration (TC t : t 5= 0) is called 

the internal history of the process X. If t > 0 is given, then “K t is called 

the (information from the) past, o(X(t)) is called the (information from the) 
present, and o (A(«) : u ^ t) the (information from the) future. The process X 
is adapted if and only if “K t Q T t for every t Y 0. 

5.88. Definition. Let X and Y be two processes. The processes X and Y are 
said to be non-P-distinguishable or P-indistinguishable provided there exists a 
P-null subset N with the property that for every oj f N and for every t y 0 
the equality X(t, u>) = Y(t,u> ) holds. The process X is called a modification of 
the process Y (or also Y is a modification of X) if for every t ^ 0 there exists 
a P-null set N t with the property that X(t,cu) = Y(t,u>) for u> <£ N t . Thus the 
null set is f-dependent. If the processes X and Y are not distinguishable, then 
A is a modification of Y. In general, the converse statement is not true. 

5.89. Theorem. Suppose that the process X as well as the process Y possesses 
right-continuous paths. If X is a modification of Y, then X and Y are not 
distinguishable (also called stochastically equivalent). 

Proof. Let A be a modification of the process Y . For every t ^ 0 there 
then exists a null set N t such that X(t) = Y(t) on the complement of N t . Put 
N = {J te qN t . Then P(A) = 0 and for every t e Q the equality A (t) = Y(t) 
holds on the complement of N. By right-continuity of the paths it then follows 
that 

X(t)= i™ lim rw-y(t) 

on the complement of N and completes the proof of Theorem 5.89. □ 

5.90. Definition. Let (T* : t ^ 0) be a filtration and let T : Si —> [0, go] be a 
“stochastic time”. The function T is called a stopping time for the filtration 
(T t \ t 'y A) if for every fixed time t the event {T A t} belongs to “J t . Since 
the event {T < go} = (J n6H {T < n } belongs to T x , the complementary event 
{T = go} is also an element of 
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5.91. Theorem. Let : t ^ 0) be a filtration. Let (T t+ : t ^ 0) be the so-called 
right closure of the filtration (T t : t ^ 0). Then a stochastic time T : Cl —► [0, go] 
is a stopping time for the filtration (T t+ : t ^ 0) if and only if, for every t > 0, 
the event {T < t} belongs to T t . 


Proof. u Sufficiency” Suppose that for every t ^ 0 the event {T < t\ be¬ 
longs to Then the event {T ^ t} = j T < 1 + — 1 belongs to the cr-held 

neN f n J 

flneN ^t+n -1 = 


“ Necessity” Assume that for every t ^ 0 the event {T ^ t} belongs to T t+ . 
Then the event {T < t} = U T 1 - - } belongs to \J nefq ff t _ n -i + c= ff t . 

neN f n ) 

This completes the proof of Theorem 5.91. □ 

5.92. Corollary. Let (9y : t ^ 0) be a right-continuous filtration. Then the 
stochastic time T is a (ff t : t ^ 0) -stopping time if and only if for every t ^ 0 
the event {T < t} belongs to and this is the case for every t > 0 if and only 
if for every t > 0 the event {T ^ t} belongs to T t . 
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5.93. Theorem. Let (T t : t ^ 0) be a right-continuous filtration, let X be an 
adapted cadlag process, let G be an open subset and let F be a closed subset of 
E. In addition, let (G n :neN) be a sequence of open subsets of E such that 
F = Pi n G n and such that G n 3 G n+ \, n e N. Finally, let ( F n :neN) be an 
increasing sequence of closed subsets with the property that G = (J n F n . Define 
the times S, S n , T and T n by means of the equalities: 

S = inf {s > 0 : X(s) e F or X{s~) e F} ; 

S n = inf {s ^ 0 : X(s) 6 F n or X(s-) e F n } ; 

T n = inf {s ^ 0 : X(s) e G n } and T = inf {s ^ 0 : X(s) e G} . (5.165) 

Then these times are stopping times and the following assertions hold: S n [ T 

and T n j S. 

Proof. Let t > 0. Since the paths are continuous from the right se see 
{T < t) = 1J {A'(r) e G} = [J {X(r) s G) s 

0 <r<t 0 <r<t,rQ 

This proves that T is a stopping time. Since 

{S < t} = {X(t) ef or X{t~) e F} u n u £f > 

\neN r<t,reQ J 

it follows that S' is a stopping time as well. Since G n 3 G n+ 1 it follows that 
T n+ 1 > T n . Put So = supT n . The ultimate equalities in 

00 00 

{So < *}=u n u s 

m—1 n—1 

oo 

- U U W*)EfofVHeFl 

m—1 m _1 

= U {X(s) e F of X(s-) e F} = {S<t} 

0^s<t 

prove the equalities {So < t) = {S < t} for all f > 0 and hence, S = So- The 
fact that S n J, T is left to the reader as an exercise. This completes the proof 
of Theorem 5.93. □ 

5.94. Theorem. Let S and T be stopping times for the filtration (Sq : t ^ 0). 
Then min(S, T), max(S, T ) and S + T are also stopping times for this filtration. 
If (S n : n e N) is a sequence of stopping times, then sup n S n is also a stopping 
time, and if, moreover, the filtration (Tt : t ^ 0) is right continuous, then inf n S n 
is stopping time as well. 

PROOF. The proof is left as an exercise for the reader. □ 

5.95. Definition. Let T be a stopping time for the filtration (9q : t ^ 0). The 
u-fie Id of events which precedes T is defined by 

J T := Pi pi e Too : A n {T < t} e %}. 

t^O 
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Indeed, the collection 3t is a a-field and if T = t is a fixed time, then 3t = 3 t . 
If S < T is also a stopping time, then 3s c dr- If the filtration (3 t : t ^ 0) 
is continuous from the right, then an event A belongs to 3j- if and only if A 
belongs to T x , and if for every t > 0 the event A n \T < t} belongs to 3 t . If S 
and T are stopping times, then 9 r m i n (g,T) = 3s n 3t- If the filtration (3 t : t ^ 0) 
is right continuous and if (S n : n eN) is a sequence of stopping times which 
converges downward to S, then S' is a stopping time and f\eN ?s n = 7 S . 

5.96. Definition. A process X : [0, oo) x Cl —> E is called progressively mea¬ 
surable for the filtration (3 t : t ^ 0) if for every t > 0 the restriction of X to 
[0, f] x Cl is measurable for the a-fields 23 [0, t] (x) 3 t and £. 

5.97. Theorem. If X is right-continuous adapted process, then X is progres¬ 
sively measurable. 


Proof. Define the sequence of processes (A" : n e Id) by means of the for¬ 
mula: 

„ , , (X(*£t,u), if k2~ n t < u < (k + l)2~ n t, 0 < k < 2 n - 1; 

V ' [0, if« = 0. 

(5.166) 

Let B e £. Then we have 


{X n e B } 

= {0} x {A'(0) e B] u U 
E*[0 


0^fc<2 n -l 


k k + 1 


2 n ’ 2 r ‘ 


X 


k + 1 


B 


(5.167) 


So X n is progressively measurable. Because the process X is P-almost surely 
right-continuous it follows that lim„^ x X" = X, and, consequently, X is pro¬ 
gressively measurable. This completes the proof of Theorem 5.97. □ 


5.98. Theorem. Suppose that X is progressively measurable for the filtration 
(3 t : t ^ 0). Let T be a stopping time. The the state variable X(T) : u> i—> 
X(T (uj), uj) measurable for the a-fields £ and 3t- 


Proof. On the event {T < t} the mapping u> i—> X (T(lu),ui) is the com¬ 
position of the mapping u >—> (T(u),uj), which goes from {T < t) to [0, t] x 
and which is measurable for the a-fields 3 t and B[0. t] ® 3 f , and the mapping 
(u, u>) I—► X(u,u), which goes from [0. t] x O to E and which is measurable for 
the a-fields 23[0,t] ® 3 t and £. (In the latter argument the progressive mea¬ 
surability of X was used.) The composition of measurable mappings is again 
measurable, and hence X(T) is measurable for de a-fields 3t and £. 

This completes the proof of Theorem 5.98. □ 

5.99. Corollary. IfT is a stopping time and if X is progressively measurable, 
then the process X T defined by X T (u) = X(rnin(T, u)) is adapted to the stopped 
filtration (3 min{ r,u) : u ^ 0). 
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Proof. The proof is left as an exercise for the reader. □ 

The next lemma is often employed instead of the monotone class theorem. 

5.100. Lemma. Let T be a a-field on Cl and H a vector space consisting of 
£T -measurable real-valued bounded functions on Cl. Suppose that the following 
hypotheses are fulfilled: 

(1) H contains the constant functions; 

(2) If f and g belong to H, then the product fg belongs to H; 

(3) If f is the pointwise limit of a sequence of functions (f n : n e N) in H, 
for which \f n \ < 1, then f belongs to H; 

(4) T = a (f : f e H). 

Then H contains all bounded T -measurable functions. 

Proof. Let T be the collection D = {A e jl: L e H}. Then T is a Dynkin 
system and by (2) T is closed for taking finite intersections. So 2) is a cx-field. 
Pick / e H and let a e M. We will prove that the set {/ ^ a } belongs to T. 
By taking an appropriate combination of / and the constant function 1 we may 
assume that 0 ^ ^ 1 and that 0 < a ^ 1. Let p be a polynomial. By (2) p(f) 

belongs to H. Let ip : [0,1] -» M be a continuous function. By the theorem 
of Stone-Weierstrass there exists a sequence of polynomials (p n : n e N) such 
that sup^gj-g^ \p(x) — p n (%)\ n -1 . Consequently, <p(f) belongs to H. Since 

the function l[ a;00 ) is a (decreasing) pointwise limit of a sequence of continuous 
functions, it follows that l[ a ,oo)(/) = 1 {/> a } belongs to H. So the set {/ ^ a} 
belongs to T>. From which it follows that T> = T. But then we infer T cz 
{A e T : Ia £ H}. From this the assertion in Lemma 5.100 immediately follows. 

□ 


5.101. Definition. Let (T t : t ^ 0) be a filtration on the probability space 

(f2,T,P), 


and let X be an adapted process. 

(i) The process X is called a martingale (relative to P and to the filtration 
(fJt : t ^ 0)) if for every t ^ 0 the variable X(t) belongs to L 1 (Cl, T, P) 
and if for every pair 0 < s < t the equality W(s) = E (X ( t ) | T s .) holds 
P- almost surely. 

(ii) The process X is called a sub-martingale (relative to P and to the 
filtration (T) : t ^ 0)) if for every t ^ 0 the variable X{t) belongs to 
L 1 (Cl, T, P) and if for every s < t the inequality X(s) < E (X(t) | T s ) 
holds P- almost surely. 

(iii) The process X is called a super-martingale (relative to P and to the 
filtration (T) : t ^ 0)) if for every t ^ 0 the variable X(t) belongs to 
L 1 (Cl, T, P) and if for every s < t the inequality X(s) ^ E (X(t) | T s ) 
holds P- almost surely. 
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Instead of assuming X(t) e L 1 (f2, T, P) in (ii) it is sometimes assumed that the 
variable X(t) + = max(X(/),0) belongs to L 1 (f2, SF, P). In (iii) it is sometimes 
only assumed that X(t)~ = max(— X(t),0) belongs to L 1 (fl, T, P). If T is a 
(discrete) subset of [0, oo) and if (X(t), %) t>0 is a martingale (sub-martingale, 
super-martingale), then the process (X(t), %) teT is so as well. Then we can use 
“discrete results” and via a limiting procedure we then obtain results in the 
“continuous case”. 

5.102. Definition. Let / : [0, go) —> E be a function, let T c [0, oo) and let 
a < b be real numbers. Define the number of upcrossings Urif- u, b) of / | 
between a and b by 

U T (f,a,b) 

= sup {m : there exist t\ < t 2 < • • • < fam, tj e T f (fak-i) ^ a, / (fak) ^ b}. 

(5.168) 
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5.103. Lemma. Let D by the set of non-negative dyadic numbers and let f : D —> 
M be a function, which is bounded on D n [0, n ] for all n g N. Assume that, 
for all n g N and for all real numbers a < b, with a and b (dyadic) rational, 
the number of upcrossings Uc n [ 0)n j(f,a,b) of f is finite. Then the following 
assertions are true: 

(a) For every tel the following left and right limits exist: 

lim f(s ) and lim f(s); (5.169) 

s]t,seD s[t,seD 

(b) Define the function g by g(t) = lim f(s). Then g is right-continuous 

sit.seD 

and for every t > 0 the left limit lim^ g(s ) exists. 

Proof, (a) We will show that the limit lim s j tjS> t f(s) exists. Since the func¬ 
tion / is bounded it suffices to prove that lim inf s |f jS>t f(s) = limsup^ >t f(s). 
Assume that this not the case. Then there exist dyadic rational numbers a and 
b such that liminf S it,s>tf{ s ) < a < b < limsup s ; ts>t /(s). This means that 
there exists s 0 > t, s 0 e D, with f(s 0 ) > b. There also exists Si < s 0 , Si > t, 
Si e D , such that /(si) < a. In general we obtain t < s 2 fe-i < s 2 *,_ 2 , S 2 k-i e D, 
for which /(s 2 fc-i) < a and we obtain t < s 2 k < s 2 fe-i 5 s 2 k e D, with /(s 2 fc) > b. 
For m g N we write t 2m = so, t 2m -i = si, • • •, t 2 = s 2m -i, h = s 2m - 1 - Pick 
n > so- Then we have Ui)n[o,n](f, a, b) ^ m. Since m e N is arbitrary it follows 
that U jT)n[o,n] (/• a i b) = co. So we obtain a contradiction. The existence of the 
left limit can be treated similarly. 

(b) Put g(t) = lim s it, s >t,se d f(s). By (a) this function is well defined. Since, 
for every n e N, the function / is bounded on the set D n [0, n] the function 
g possesses this property as well. Let now (t n : n e N) be a sequence that 
decreases to t and for which t n > t for all n e N. We will prove lim„^, x , g(t n ) = 
g(t). Then this shows that g is right-continuous at t. Assume lim infi^oo g(t n ) < 
g(t). This will lead to a contradiction. By passing to a subsequence, which we 
call again (t n : n e N), we may suppose that liminf n ^oo g(t n ) = lim n ^oo g(t n ) 
and that there are numbers a and b e D such that for all n e N, g(t n ) < 
a < b < g(t). Then pick so > to such that /(so) < this possible, because 
g(to) < a. The pick So > to > s i > t in such a way that f(si) > b: this is 
possible, because g{t) > b. Then choose t n2 , Si > t n2 > t, with g(t n2 ) < a. Then 
there exists Si > s 2 > t n2 , such that f(s 2 ) < a. This is so because g(t n2 ) < a. 
This procedure can be continued. Like in (a) we arrive at t/D n [o,n](/> a , b) = go, 
for a certain nN, n > t. This is a contradiction. But then it follows that 
lim inf n ^oo ^(tn) 5 s g{t). In the same fashion we see that limsup^^ g(t n ) < 
git). Consequently, gift) = lim n ^.oo g(t n ). In order to prove the existence of the 
left limit of the function g at t, we choose a sequence (t n : n e N), that increases 
to t, and which has the property that t n < t for all n g N. Assuming that 
liminf^oo g(t n ) < limsup^^, g(t n ), then, as above, we arrive at the conclusion 
that, for certain dyadic numbers a < b, for which lim inf r! ^ x g(t n ) < a < b < 
limsup^QQ g(t n ), the number of upcrossings of the function / on the interval 
D n [0, n] with n > t is infinite. 
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This completes the proof of Lemma 5.103. □ 

5.104. Theorem (Doob’s optional time theorem for sub-martingales). Let 

(X(j) : j £ N) 

be a sub-martingale relative to the filtration (T n :neN), and let T ^ S be 
stopping times. Suppose that E[|X(T)|] < go and also E[|X(S')|] < go. If, 
additionally, lim E [X (m) : T ^ m ^ S] = 0, then X(S) is measurable for the 

ra—>00 

a-field 3s and the inequality E [X(T) | Tg] ^ X(S) holds P -almost surely. 

PROOF. Let A be an event in 30,’. For every j, j ^ 1, and for every I e N, 
I ^ 0, the event A n {T ^ £ + j} n {S = 1} n A then belongs to the cr-held 
Ai+j-i. To see this, observe that the event {T > k\ = fi\ {T ^ k — 1} belongs 
to “Jk-i■ Since 

(X(min(T, m)) — .^(mh^S 1 , m))) 1 a 

m m—i 

= L L (X(t + j ) - X(£ + j - 1)) 1 

t= 0 3 =1 

it follows that 

E ((X(min(T, m)) — X(min(S', m))) 1^) 

m m—i 

= L L ® + J) — X(£ + j ~ 1)) 1 {T^e+j}n{S=e}nA) ■ 

e=o j =1 

Hence, E ((X(min(T, m)) — X(min(iS, m))) 1^) ^ 0. Since, in addition, 

E (X(T) - X(S) - X (min(T, m)) + X(min (S, m))) 

= E (X(T) - X(S) :S^m) + E (X(T) - X(m) : T > m > S ), 

the claim in Theorem 5.104 follows. □ 

5.105. Proposition. Let (X(n) : n e N) be a (sub-)martingale relative to the 
discrete filtration (3 n :neN). 

(a) Let H = ( H(n ) be a positive bounded process with the 

property that H n is measurable for the a-field T n _i. Define the process 
(Y (n) : n e N) by 

n 

F(0) = X(0), Y(n) = X(0) + J] H(k) (X(k) - X(k - 1)), n> 1. 

k =1 

Then the process Y is a (sub-)martingale. By putting H(n) = l{ n «:T}> 
where T is a stopping time we see that process 

X T := (X(min(7») : n e N) 

is a (sub-)martingale. 
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(b) Let S and T be a pair of bounded stopping times such that 0 ^ S < T. 
Then 

X(S) < E (X(T) | £Fg) , P -almost surely, (5.170) 

and if X is a martingale, then there is an equality in (5.170). 

Moreover, an adapted and integrable process X is a martingale if and only if 
E (X(T)) = E (X(S)) for each pair of bounded stopping times S and T for which 
S < T. 

Proof, (a) The first assertion in (a) is easy to see. To understand the sec¬ 
ond assertion we observe that l{i>n} = 1 — l{r<n-i} is measurable for the a-field 
T n _i and we notice that X(0)+2fc = i (X(k) — X(k — 1)) = X(min(T, n)). 

This proves assertion (a) in Proposition 5.105. 

(b) De inequality X(S) < E (X (T) | Tg), P-almost surely was already proved in 
Theorem 5.104 and can be obtained from (a) by putting H{n ) = l{r^ n } ~ 

If we use the equality E(X(5 ,B )) = E(X(T B )) for de times S B = SIb + M 1 b c 
and T B = TIb + M1b<=, where B belongs to “Js an( l where M ^ T ^ S, then 
we get 

E (X(T)1 b + X(M) l B c ) = E (X{S) 1 B + X(M) l B c ). 

But, then it follows that E(X(T)1 S ) = E(X(5')ls) for all B e Is and hence 

X(S)=E(X(T)\? s ). 

The proof of Proposition 5.105 is now complete. □ 
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5.106. Theorem (Doob-Meyer decomposition for discrete sub-martingales). 
Let (X(j) : j e N) be a sub-martingale. Then there exists a unique martin¬ 
gale M = (. M(k ) : k 6 N) together with a unique predictable increasing process 
A = ( A(k ) : k eN), with A(0) = 0, such that X(k) = M(k) + A(k), for fceN. 

5.107. Remark. This theorem is, in an appropriate form, also true for sub¬ 
martingales X of de form X = (X(t) : t ^ 0) (continuous time). A process 
A = ( A(k ) : k e N) is called predictable, if A(k) is measurable for 3q ; _i, and 
this for every k e N. 


PROOF. Existence Define the process A by A(0) = 0 and 

k 

A(k) - 2 E PO') - - 1) | Sy-i) • 

3 = 1 

Define the process M by M(k ) = X(k) — A(k). Then the process M is a 
martingale and the process A is increasing (i.e. non-decreasing) and predictable. 
Moreover, the equality X = M + A holds. 

Uniqueness Let the process X be such that X = M + A where M is a martingale 
and where A is predictable and increasing. In addition, suppose that A(0) = 0. 
Then the equalities 

k 

2 E(XU)-X(j-l) I 

3 = 1 

k k 

= 2 E (MU) - M(j - 1) | J,--,) + £ E (A(j) - A(j - 1) | 

3 = 1 3 = 1 

= 2 (m - mj - 1)) = A(k), 

3 = 1 

hold for k ^ 1. So the proof of Theorem 5.106 is complete now. □ 


5.108. Theorem. Let X = (X(k) : 1 ^ k ^ N) be a sub-martingale. Then the 
following inequality holds: 


E (U {1 


(x _ h \\ - E[max(X(iV)-o,0)] 

,N}{X,a,b)) ^ -—- 


PROOF. For a proof we refer the reader to Proposition 3.71 of Chapter 3. 
Notice that, with X the process max(X — o, 0) is also a sub-martingale. □ 


5.109. Theorem. Let X = (X(t) : t ^ 0) be a sub-martingale for the filtration 
(9y : t ^ 0) . For a < b the inequality 


E (f7 Dn [ 0; 7v] ( X , a, 6)) 


< 


E [max (A (IV) 
b — a 


o,0)] 


holds . 
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Z 

PROOF. Write D n = •— and define U n by U n = UD n n\om(X, a,b). The 

2 n L ’ J 

sequence U n then increases to UDn[o,N](X, a, b). So it follows that 


E (U D n[ o,n]) = lim E (U n ) 


< 


E [max(X(AT) — a, 0)] 


b — a 


This completes the proof of Theorem 5.109. 


□ 


The following theorem contains Doob’s maximal inequalities for submartingales. 

5.110. Theorem. Let X = (X(0), ..., X(n)) be a sub-martingale. Then the 
following maximal inequalities of Doob hold: 

(a) 

(b) 

(c) 


*S£n Xj >X ) ^ ^E(X(n)); 

max \XA > A ) < ^E (5 |X(n)| - 2X(0)); 
KjsSn J J A 


and if X is a martingale 


P( muftis A) <l{E(|X(n)|)}. 


PROOF. We begin with a proof of (c). Consider the mutually disjoint events 


A 0 = {|X 0 | > A}, and A k := ||X(fc)| > A, q max^ \X(J)\ < A 
1 < k < n. Then {Jl =Q A k = {max 0 ^^ n |X(j)| ^ A}. Therefore 

P 


3 = 0 


max I X(j)\ ^ A 

o^j^n v ' ' 

and so, using the martingale property 

P(^) = E(UJ<iE[l /lt |X(k)|] 

(martingale property) 

= 1e[U, |A»| | 3>] s; iE[U t E(|A(n)| | Si)] = 1 e[U, |A(n) 


A 


A 


(5.171) 


By summing over k in (5.171)we get (c). 


(a) The proof of (a) follows almost the same lines, except that in the definitions 
of the events A k the absolute value signs have to be omitted. 

(b) For the proof of this assertion we employ the Doob-Meyer decomposition 
theorem (Theorem 5.106). Write X = M + A with M a martingale, and A 
(predictable) increasing process. We let M(0) = X(0). Then we see 

P (jnax |X(j)| ^ a) < P (jnax \M(j)\ ^ —) + P (A(n) ^ — 
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(by (c)) 

< ^E \M(n)\ + ^E (A n ) 

< ^E (\M(n)\ - M(n ) + X(n)) 

A 

< ^E (2 |X(n)| + 2 A(n) + X(n)) 

A 

< (5|X(n)| -2A(0)). 

A 

This proves assertion (b). 

The proof of Theorem 5.110 is complete now. □ 

5.111. Lemma. Let (A n : ne N) be a sequence of a -fields decreasing to the a- 
field A^. So that A n+ 1 c A n , n e N, and Aoo = ^et (f n : n e N) u 

{/oo} a sequence of stochastic variables with the following properties: 

(i) f n is A n -measurable, neN, and fee is A^-measurable; 

(ii) f m < E (f n | A m ), /or all m^n, and /<» < E (f n \ A x ), n e N; 

(hi) lim^oo E (/ n ) = E (/oo). 

Then the sequence (f n '■ n e N) uniformly integrable. 


Proof. For m = 1, 2,. .., oo we have 

/m < E (/i | A m ) and max(/ m , 0) ^ E (max(/ m , 0) | A m ) . 

From this it follows that the sequence (max(/ m , 0) : 1 < m < oo) is dominated 
by an integrable function (in fact by E (max(/i, 0))). So it follows that this se¬ 
quence is uniformly integrable. The fact that the sequence (max(-/„, 0) : n e N) 
is also uniformly integrable, is much less trivial. To this end we consider 

- AP (f n < -A) >E(f n :fn< -A) = E(/ n ) — E (f n :f n > -A) 

> E (E (fn | A 0 o)) - E (E (/i | A n ) : f n ^ -A) 
>E(f ao )-E(f 1 :f n >-\) 

> E(/oo) - E(max(/i,0)). (5.172) 

From (5.172) it follows that 

AP (fn < -A) < E (max(/i, 0) + max(-/ 00 , 0)) < oo. 

Then choose e > 0 and mo in such a way that E (f mo ) < E (/oo) + e. For n ^ mo 
we then see E(/ mo ) < E (f n ) + e. Hence, E (f n ) A E(/ mo ) — e. Then choose 
8 > 0 such that P(A) < 8 implies E (|/*.| : A) ^ e for k = 1,..., mo- After that 

choose Ao so large that P (fk < A) ^ 8 for all k e N and for all A ^ Ao- For 

1 < k < mo we then get E (\fk\ : fk ^ —A) < e, A 5= Ao- For k ^ mo we see 

E (f k : f k < -A) = E (f k ) - E (f k : f k > A) (5.173) 

> E (f mo ) - E (E (f mo | A k ) ■ fk > -A) - e > E (f mo ■ fk < -A) - e. 
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By (5.173) we obtain 

E (|/jfc| : f k < -A) = -E (f k : f k < -A) 

< |E (fmo ■ fk < —A)| + e < 2e 

for a certain A > 0. Thus we see that the sequence (max(-/„, 0) : n e N) is also 
uniformly integrable. This yields the desired result in Lemma 5.111. □ 

5.112. Theorem. Let, relative to the right-continuous filtration (Tf : t ^ 0), 
the process X be a sub-martingale. Suppose that To contains the zero-sets, and 
that the function t >-» E(X(£)) is right-continuous. Then there exists a process 
Y = (Y(t) : t ^ 0) which is cadlag is which cannot be distinguished from X. So 
for every t ^ 0 the equality Y(t ) = X(t) holds P -almost surely. 

PROOF. There exists an event Cl' in Cl, with P(fl') = 1, such that on Cl' the 
following claims hold: 

sup |X(f)| < go, for all n e N; 

teDn\0,n ] 

UDn[o,n](X, a, b ) < go, or all n e N and for all a < b, a and b rational. 

Since P (IT) = 1 we see that Cl' belongs to To and, hence Cl' belongs to Tt for 
all t ^ 0. On Cl' we define the process Y = (Y(t)]t ^ 0) as follows: Y(t) = 
lim s |f ]S> t iS6 D X(s). Then Y(t) is measurable for all cr-fields T„ with u > t. By the 
right continuity of the filtration (T t : t ^ 0) we then see that Y(t) is measurable 
for the <7-field Tt- Then take t = lim n ^, x , s n , where s n { t, and where, for every 
n e N, s n belongs to D. Then Y{t) = lim^oo X(s n ) in probability. Then 
apply Lemma 5.111 to conclude that the sequence ( X(s n ) : n e N) is uniformly 
integrable, and hence Y(t) = L 1 — lim n ^oo X(s n ). We may apply Lemma 5.111. 
for f n := X(s n ), /oo = Y{t), A <*> = Tt and A n = T Sn . Then notice that 
X(t) < E (X(s n ) | Tt), P-almost surely. By L 1 -convergence, from the latter we 
see that X(t) ^ E (Y(t) | Tt) and thus X(t) ^ Y(t) P-almost surely. Since, 
in addition, E(T(t)) = lim n ^ooE(W(s n )) = E(X(t)), the equality Y(t) = X(t) 
follows P-almost surely. 

This completes the proof of Theorem 5.112. □ 

5.113. Theorem. LetX = (X(t) :t^0)bea sub-martingale with property that 
supE [X(t) + ] < go. The following assertions hold true. 

t:?0 

(a) The limit X(co) := lim^o^eD X(s) exists P -almost surely. 

(b) If X is a cadlag process, then the limit X(cc) := lim^oo exists P- almost 
surely. 

(c) If, in addition, the process (X(t) + : t ^ 0) is uniformly integrable, then 
the inequality X(t) < E (X(oo) | Tt) holds. 

PROOF, (a) From the maximal inequality of Doob it follows that, for A > 0, 
the following inequality holds: 

AP ( sup \X(t)\ > A ] ^ 10E (X(n) + + X(0)~) . (5.174) 

\ teDn[0,n] I 
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By letting n tend to go in (5.174) we obtain 

AP (sup|A(f)| > A ) < 10supE(A(n) + + X(0)“) , 

\ teD J n 

and hence, su.p teD |X(i)| < go P-almost surely. In the same manner we see 

E (U Dn[ 0 , 00 )(X, a, b )) < sup E (X(T ^ ~ a)+ . (5.175) 

n 0 CL 

From (5.175) we see that Uc n [ 0tOD )(X,a,b) < go P-almost surely. As we proved 
regularity starting from (5.175) and (5.174) (in fact from their consequences), 
we now obtain that X(oo) := lim s _K X)iS6 .D X(s) exists. 

(b) If X is cadlag, then, like in the proof of the regularity, the limit X(oo) = 
lim s ^oo X (s) exists. 
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* 
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A 
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P 
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(c) Since the process (X(t) + : t P 0) is uniformly integrable, it also follows that 
the process t >—>■ max(X(f), a) = ( X(t ) — a) + + a is uniformly integrable as well. 
So, for Ae “J t and for u > t, the (in-)equalities 


[ max(X(f), a) dP < lim [ max(X(u), a) dP 

Ja u ^°° Ja 

= I lim max(X(n), a) dP = ( max(77(co), a) dP 

Ja Ja 


hold true. Since 


I X(oo)+dP = lim I X(uY 

J u^ccj 


dP < oo 


we see that X(oo) + belongs to L 1 (Q, T, P). But then we get 



lim I max(X(t), a) dP < lim I max(X(co), a) dP 

00 a^—cc J ^ 

f X(oo)dP. (5.176) 

Ja 


From (5.176) the inequality X(t) < E(X(oo) | 3q) follows. This proves item 
(c). The proof of Theorem 5.113 is now complete. □ 


5.114. Theorem. Let X = (X(t) : t ^ 0) be a sub-martingale with the property 
that the process ( X(t) + : t ^ 0) is uniformly integrable. In addition, suppose that 
X is cadlag.If S and T are a pair of stopping times such that 0 ^ S < T < co, 
then the following inequality holds: X(S) ^ E ( X(T ) | Ts). 


Proof. Put S n = 2~ n \2 n T] and, similarly, T n = 2~ n \2 n T], Then the stop¬ 
ping times S n and T n attain exclusively discrete values (in fact they take their 
values in 2 -n N). It is true that S n [ S (if n —» co) and the same is true for the 
sequence (T n :neN). Moreover, S n ^ T m for n ^ m. From Doob’s theorem 
about discrete optional stopping times it follows that 

X{S n ) < E (X(T m ) | ? Sn ) , X(S n ) E (X(oo) | T Sn ) , 

X(S n )<E(X(co)|T 5n ). 


From this it follows that the processes ( X(S n ) + :neN) and ( X(T n ) + :neN) 
are uniformly integrable. For all n, m in N, n ^ rn, the following inequality 
holds for Ae F S - 


| max(X(S' n ), a) dF < 

Ja 

[ max(X(T m ),a) dP] 

Ja 

(5.177) 

(let n tend to 00 in (5.177) to obtain) 



j max(T(5), a) dP < 

Ja 

j ma x(X(T m ),a) dP; 

Ja 

(5.178) 

(in (5.178) let m tend to co to obtain) 



J max(X(5'), a) dP < 

Ja 

[ max(X(T), a) dP; 

Ja 

(5.179) 
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(in (5.179) let a tend to — go to obtain) 

f X(S)dP ^ f X(T)dF, (5.180) 

JA JA 

and that lim rwx X(S n ) = X(S) and that the same is true for the stopping time 
T. By (5.180) we then see X(S) < E (X(T) | 3$). This completes the proof of 
Theorem 5.114. □ 

5.115. Corollary. Let X = (X(s) : 0 < s ^ t) be a cadlag martingale and let 
O^S^T^t be two stopping times. The following equalities are true: 

X{S) = E (X(T) | T s ) and E (X(T)) = E (X(oo)) = E(X(0)). 

PROOF. The proof is left as an exercise for the reader. Among other things 
notice that the martingale ( X (s) : 0 ^ s C t) is uniformly integrable. □ 

5.116. Corollary. Let X be a cadlag martingale in L 1 (O, T, P) which is uni¬ 
formly integrable. Then the limit X(co) := lim^oo X(t) exists P -almost surely, 
and if S and T are stopping times such that 0 < S < T < go, then the following 
equalities hold: 

X(S) = E (X(T) | T 5 ) and E (X(T)) = E (X(oo)) = E(X(0)). 

PROOF. The proof of this corollary is left as an exercise for the reader. 
Observe that for n e N fixed the martingale (X(min(n, t) ) : t ^ 0) is uniformly 
integrable. □ 

In what follows the process X : [0, go) x fl —> E 1 " is a process with values in RC 
where u may be 1. 

5.117. Definition. Let A be a stochastic process, which is adapted to the 
filtration (% : t ^ (1). The process X is said to be a Levy process if X possesses 
the following properties: 

(a) For all s < t the variable X(t) — X(s) is independent of 

(b) For all s ^ t the variable X(t) — X(s) has the same distribution as 

x(t- s y, 

(c) For all t ^ 0 and for every sequence (t n : n e N) in [0, go) that converges 
to t, the limit lim n ^oo A(t n ) = X exists in P-law (or in P-measure). 
Sometimes this is denoted by P-lim n ^ Xj X (t n ) = X(t). 

5.118. Theorem. Let X be a stochastic process, which is adapted to the filtration 
(5ft : t ^ 0), and which takes it values in RC The following assertions are true: 

(a) Let X be a Levy-process. Define for t ^ 0 the probability measure pL t as 
being the distribution of X(t). So pfiB) = P(X(f) e B), where B is a 
Borel subset of RC Then the family {p t : t ^ 0} is a vaguely continuous 
semigroup of probability measures. 

(b) Conversely, let {p t ■ t ^ 0} be a vaguely continuous semigroup of prob¬ 
ability measures on RC Then there exists a Levy-process 

X = {X(t) :t> 0} 
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with cadlag paths such that fo(B) = P (X(t) e B) for all Borel subsets 
B of My. 


PROOF, (a) Define for t ^ 0 the characteristic function f t of X(t) as being 
the Fourier transform of the P-distribution of X(t). So that 

/,(£)-E (exp (-i «,*(()»), (£»'■ 

Since, for t, se [0, oo), X(s + t) = X(s + t) — X(s) + X(s), since, in addition, 
X(s + t) — X(s) is independent of X(t), and because X(s + t) — X(s) possesses 
the same distribution as X(t) we infer 

fs+t(Q = E (exp (—* <£, *(* + *)») 

= E (exp (—i (£,X(s + t) - X(s)}) exp (-z (£,X(s)))) 

= E (exp (—i (£,X(s + t) — X(s)))) E (exp (—i (f,X(s)))) 

= E (exp (-* <f, X(t)))) E (exp (-* <£, X(s)))) 

= MOfsiO- (5.181) 

Since X(0) and X(0) — X(0) = 0 have the same distribution we see /o(£) = 1. 
Since P-lim^o X(u) = X (0) we see, for example by Theorem 5.85 in combination 
with the implication (1) =^> (9) of Theorem 5.43, that lim s jo/s(£) = /o(£) = 1. 
From (5.181) it then follows that 

lim/ t (0 - f s (0 = hm f a (£) (, fts(0 ~ /o(0) = 0 

t[s tis 

for all s ^ 0. Because, by applying equality (5.181) repeatedly, we see / t (£) = 
(/f 2 -n(^)) 2 . In addition we have lim s |o fs(0 = 1. So it follows that for no value 
of t e [0, go) the function / t (£) vanishes for any £. Since, for t < s, /t(£) — /«(£) = 
(/o(0 - fs-tiO) /*(£)> if also follows that lim tTa / t (f) = / s (f), for s > 0. From 
the previous considerations it follows that the function t >-* /*(£), t e [0,oo), 
is a continuous function, which satisfies the relation f s+t (0 = for 

all s, t > 0 and this for all £ e M !/ . Furthermore, we define the family of 
measures {/i t : t ^ 0} as being the P-distributions of the Levy-process X. So 
that m{B) = P (X(t) e B), B Borel subset of M". From the previous arguments 
it then follows that 

hs+t(0 = fs+t(0 = fs(0M0 = MOMO (5.182) 

and that lim^o hs(0 = 1- So that the family {/r t : t ^ 0} is a vaguely continuous 
semigroup of probability measures on W'. By Theorem 5.31 there then exists a 
continuous negative-definite function ip such that ft(f) = fit{€) = ex P (— 

(b) Define (f2,T, P) as in Proposition 5.36. Likewise we define the state vari¬ 
ables X(t) : as in Proposition 5.36. Let the filtration {fj t : t ^ 0) 

be determined by % = a (X(u) : 0 ^ u ^ t). So the filtration : t ^ 0) is 
the internal history of the process X. Then X is a Levy-process, which pos¬ 
sesses the properties as described in (b). The fact that for t > s the variable 
X(t) — X (.S') is independent of T s was proved in Theorem 5.37. We must show 
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that, for e > 0 fixed, lim s |o P (|-^( s ) — X(0)| > e) = 0. Therefore, notice first 
that P(X(0) = 0) = /xo{0} = 1. Hence, with B(e) = {ier: |x| < e}, we have 

P(|X(s) - X(0)\ > e) = P(| X(s) - X(0)| > e,X(0) = 0) 

= P(|X(s)| > e,X(0) = 0) 

= P(|X( S )|>e) 

= Ms {r'YB(e)} = 1 -Us {B(e)} . (5.183) 

Since the convolution semigroup {/i t : t ^ 0} is vaguely continuous it follows that 
lim n s {B(e)} = 1. From (5.183) we then see that lim^o P (|X(s) — X(0)| > e) = 

s|0 

0. The only problem which is still left, is the fact that the process X is not 
necessarily cadlag. In the following propositions and lemmas we will, among 
other things, resolve this problem. From Theorem 5.121 it follows that the 
process X is also a Levy process for the filtration (St : t ^ 0), where St = SyuN. 
By Theorem 5.123 we then see that the process X possesses a cadlag version. 

The proof of Theorem 5.118 is now complete. □ 
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5.119. Proposition. Suppose 0 < si < • • • < s m and choose t ^ 0. Let X be a 
Levy process for the filtration (+ : t 5= 0), where % = (t {X(u) : 0 ^ u < t}. Let 
{p t : t ^ 0} be the corresponding convolution semigroup and if the corresponding 
negative-definite function. So m(B) = P (X(t) e B ) for all Borel subsets B and 
fiiifi.) = exp (— tip (£)) for all t ^ 0. For £ 1 ,..., in ML the following equalities 
hold: 


E 


ex p ( (?’ X ( t + s j)) 


3 = 1 


= E 


exp ( + s i)) 

3 = 1 



exp 





exp 


3 = 1 




(5.184) 


Here we write ++ = ^ s and Sq = 0. 


PROOF. We apply induction with respect to m. We begin with the condi¬ 
tioning on %. For ra = lwe have 

E[exp (-i(e,X(t + Sl ))) |+] 

= E[exp(-z(£ 1 ,X(t + s 1 ) -X(t))) I B t ]exp (-i(^,X(t))) 

(X(t + Si) — X(t) does not depend on +) 

= E [exp (-* (e 1 , X(t + 3l ) - X(t)»] exp (-* {?, X(t))) 


(X(t + Si) — X(t) has the same distribution as X(si)) 


= E [exp (-* (C 1 , X (si)))] exp (-i (f\X(t))) 

= Ps i (^)exp(-i (£\X(t))) 

= exp (siip (C 1 )) exp (-* (^,X(t))) 

= exp (- (si - s 0 )ip(^)) exp (-* (^,X(t))) . (5.185) 


Notice that (5.185) is the same as the equality in (5.184) for m = 1. Suppose now 
that we already know (5.184) for every t ^ 0, for every m-tuple si < ■ ■ ■ < s m 
and for every m-tuple + ,... in M !/ . We keep working with the original 
filtration (3y : t ^ 0). For s m+ i > s m and for £ TO+1 e E" we then see 


E 


771 + 1 


ex P -* 2 & + s i)) I ^ 


3 = 1 


= E 


exp ( -iYi(¥, x (t + Sj)) 

3 = 1 


E [exp (—* (fi m+1 ,X ft + s m +i))) \% +Sm ] | + 
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(employ (5.185) for t + s m+ i instead of t) 

/ m 

= E expf-ig(^,X(t + Sj -)> 

exp (-i (£ m+1 , X (t + s m +i))) exp (- (s m+1 - s m ) ip (£ m+1 )) | % 
(induction hypothesis) 


( I m+1 \ \ / m+1 /m+1 N 

-* y2 ex p (- 2 («i - s i-i)(^2 ^ 11 • ( 5 - 186 ) 

But (5.186) is the same as (5.184) with m replaced by m + 1. Next we look 
at the situation for the filtration {3y+ : t ^ 0} which is closed from the right. 
Without loss of generality we may assume that si > 0. In case si = 0 we have 
indeed 


E 


exp -z2(e,X(t + Si )) ) \+ + 


3 = 1 


ex P (—* (^,X(t)))E 


ex P [ -i 2 (£ 3 ’ x (t + Sj))J | %+ 

So assume that Si > 0 and choose n e N such that Si > n~ l . Then we see, by 
(5.184) for t + n~ l instead of t, 


E 


exp ( ~i 2 , X (t + sj)) j | 3 r t + 


j =i 


(5.187) 


= E 


E 


exp ( -i 2 (t + sj)) j | + 

i =i 


t+n 


% 


t+ 


(write so = n 1 in what follows) 
= E 


ex P | ~i (2^’ X {t + n X ) 

ex p i - 2 _ -N-i) ^ (2 ) ) I ^ 

3 = 1 \k=j 


t+ 


In (5.187) we let n tend to oo. Apparently it follows that 


E 


exp — i 2 (e, *(* + *;)) | ?t+ 


3 = 1 


= E 




exp 


-*(2e,*(t))) e xp(-2^ 2 

\k=j 


+ 


t+ 


\J = 1 


i=i 
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= exp \ -i exp X ( s i “ yEl ^ 

From (5.188) it then follows that (5.184) holds for the filtration {fF t+ 
which is closed from the right. 


(5.188) 
:t> 0} 


This completes the proof of Proposition 5.119. 


□ 


5.120. Corollary. Let the assumptions and hypotheses be as in Proposition 
5.119. For every bounded complex-valued random variable Y, that is measurable 
for the a-field a{X(u ) : u ^ 0} the following equality holds P-almost surely de 
equality: 


E [Y | %] = E [Y | T t+ ] . 


(5.189) 


Proof. Put Y = exp Xijli (£ 7 ) By Proposition 5.119 we see 

that for all such random variables Y the equality in (5.189) holds, provided 
that Sj ^ t, for 1 ^ j Y m. By splitting and using the standard properties 
of a conditional expectation we see that the restriction ,sy ^ t is superfluous. 
In other words the equality in (5.189) holds for all variables Y of the form 

Y = exp qli i X( s j))^ where all Sj belong to [0, go) and where all f J 
are members of R". Let Yq be a bounded complex-valued random variable, which 
belongs to the linear span of variables of the form exp (—i X!)=i {A. Xisjf'j. 
Then consider the vector space flf(F 0 ) defined by 

X(Y 0 ) 

= [Y : Cl —> C : Y is bounded and measurable for the rr-field a {X(u) : u ^ 0} 
and E (Flo | T t ) = E (YY 0 | 9y + )} . 

By employing Lemma 5.100 or, even better, the monotone class theorem we see 
that flt(lo) contains all complex-valued bounded random variables, which are 
measurable for the cx-field a \X(u) : u Y 0}. Among others we may put Y 0 = 1, 
and the claim in Corollary 5.120 follows. □ 

5.121. Theorem. LetX = {X(t) : t ^ 0} be a Levy-process. Let‘K = {‘K t : t^ 0} 
be the internal history of the process X. Let AT be the null sets in TC^. Then 
the filtration 9, with G t = cr {flf t u IN’}, is continuous from the right. 

PROOF. Let A e S t+ . By Corollary 5.120 we have 1a = E (l^ | T t ). Let 
B e T t be such that 1 B = E (l^ | Sq), P-almost surely. Then P (AAB) = 0. 
Since A = BA (AAB), we see that A in fact belongs to 9t- Cl 

5.122. Lemma. Let ( x n :neN) be a sequence of vectors in IP with the property 
that the sequence (exp (—i (£, x n )) :neN) converges for almost all (eT. Then 
the sequence ( x n : n e N) converges. 

Proof. Fix 1 Y j Y v, and let U ] be a vector valued stochastic variable 
which is zero for the coordinates k =f= j and with the property that U:{ is uni¬ 
formly distributed on the interval [0,1]. The following inequalities are true for 
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0 < S < 1: 


2Re E (l — exp (—i {Uf x n — x m ))) = E 11 — exp (—i (JJf x n — x m )) | 2 



>45 2 F{\(U j ,x n -x m )\ >5 tt} 


= 4S 2 max 


Sn 

%n,j %m,j 



Since this inequality holds for every 0 < S < 1, it follows that the j -th coordinate 
(x n j :neN) of the sequence (x n : n e N) converges. This holds for 1 < j ^ 
v. Hence, the limit lim„^ lX , x n exists. This makes the proof of Lemma 5.122 
complete. □ 


Among other things, in Theorem 5.123 the proof of item (b) in 5.118 is com¬ 
pleted. 

5.123. Theorem. Let (X(t), 3q) tSs0 be a Levy-process. Suppose that the filtration 
(T t : t ^ 0) is right-continuous, and that To contains the null sets. Then there 
exists a cadlag modification of X = (X(t) : t ^ 0). 


> Apply now 



REDEFINE YOUR FUTURE 
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PROGRAM 2015 
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The result in Theorem 5.123 can be applied in case we take the internal history, 
completed with null sets, as the filtration (3y) t>0 . 

Proof of Theorem 5.123. We will make use of the following product 
set: 

E = C Q+ = |(«t)t 6 Q + : a t e C for all t e Q+j , 
endowed with the product-cr-field £ = ® t6 Q + 25(C). Put 

D = j(cp) t6 Q + : : [0, oo) —► C cadlag with ip(t) = a t for all t e Q+|. 

Upon writing p> as the pointwise limit ip{t) = lim n ^oo ip n (t), where 

00 

<Pn(t) = Y j ( P{( k + l)2 _n ) l[fc2-»,(fc+l)2-»)(<), 
fc =0 

it can be proved that D belongs to the cx-field £. Consider the mapping: 

$ : IT x Q — E 


dehned by 


$(£, u) = exp (-* (f, X(t))) =: a t . (5.190) 

The mapping $ is measurable for the u-fields Too and £. As a consequence 
A := $ _1 (D) belongs to the cx-field 25So for every pair (£, uj) e M" x 
there exists a cadlag function / : [0, oo) —*• C with the property that the equality 
fit) = exp (—i (£, X(t))) holds for all t e Q + . Now let the negative-definite 
function corresponding to the process X be given by if. Then the process 
1 1 —> exp (—i (£,X(t)) + tip(£)) is a martingale. This is so, because, for 0 < s < t, 
we have the following equalities: 

E (exp (-i <£,*(*)>+ #(£)) |3F S ) 

= E (exp (-*(£, X(t) - X(s)) + (t - s)ip(^)) | 3"*) exp (-?*(£, X(s)) + s^(f)) 
(X(t) — X(s) does not depend on T s ) 

= E (exp (-* <£, X(t) - X(s)) + (t- s)m)) exp H <£, *(*)> + s^)) 

(X(t) — X(s) heeft dezelfde distribution als X(t — s)) 

= E (exp (—i (£,X(t - s)) + (t- s)ip(g))) exp (-* (£,X(s)) + sipfc)) 
(definition of 

= exp(-i (f,X(s)) + sip(£)). (5.191) 

From martingale theory it follows that there exists a cadlag version AU = 
(M^(f) : t ^ 0) of the martingale t *—* exp (—i (£,X(t)) + £0(0)- By this we 
mean that for every (£,i) e x [0, oo) there exists an event N t ^ with the 
following properties: P (N t ^) = 0 and for uj £ the equality = 
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exp (—i ui)) + tfi (£)) holds. Hence, for every (el" there exists a P-null 

set such that for every t e [0, co) n Q the equality 

M*(t)(u)) = exp (-* (£,X(t,uj)) + t^(f)) 

holds for all uj £ N%. In other words for all the equality: 

P {uj : (f, w) e A} ^ P {X|} = 1 (5.192) 

holds. From (5.192) it then follows that 

0=f dfi I dPl A c = I dP I l A c d£. (5.193) 

Jr*' Jh Jn Jr" 

The equality in (5.193) implies that for P ® A-almost all (oj, £) the function 
t i—► exp (—i (£, X(t, u;))) belongs to D. By Lemma 5.122 we see that P-almost 
surely the following limits exist for all t ^ 0: 

limX(s) and limX(s). 

sit sit 

seQ seQ 

Define the process Y by Y(t) = lim X(s). Then the process Y is cadlag: 

sit, seQ 

see (the proof of) Lemma 5.103 (b). Furthermore, X(t) = lim X(s) (in P- 

sit,seQ 

distributional sense), and thus X(t) = Y(t ) P-almost surely. The proof of 
Theorem 5.123 is now complete. □ 

5.124. Theorem (Dynkin-Hunt). Let (X(t),T t ) t>0 be a cadlag Levy process 
with a right-continuous filtration (9q : t ^ 0). Let T : Cl —> [0, go) be a stopping 
time which is not identically go. So that P {T < go} > 0. On the event {T < go} 
the process Y = {Y(t) : t ^ 0} is defined by Y(t ) = X(t + T) — X(T). 

(a) Under P the process Y has the same distribution as the process X under 

¥. 

(b) The a-fields “Jt and a {Y(s) : s ^ 0} are ¥-independent. 

PROOF, (a) For n e N we write T n = 2 _n [2 n T], Then (T n : n e N) is a 
sequence of stopping times with the following properties: 

(i) {T n < go} = {T < go}; 

(ii) T < T n+l < T n < T + 2~ n , n e N. 

Define the sequence of processes (Y n : n e N) via the formula: 

Y n (t ) = X(t + T n ) — X(T n ) op de event {T n < go} = {T < go} . 

Let now / : (!T) m —► C be a bounded continuous function, let A be an event 
in Tr c y Tn , and let Si < • • • < s m be an increasing sequence of fixed times. 
Then the following equalities hold: 

E [/ (T n ( Sl ), Y n (s 2 ) - Y n ( Sl ),..., Y n (s m ) - Y n (s m -i)) l An{T <oo}] 

00 

= 2 E [ l A n{ Tn =k 2 -n } f {x ( 3l + k2~ n ) - X (k2~ n ) ,... 

k—0 

,X(s m + k2 ~ m ) - X (s m _! + k2 ~ m ))] 
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(X(sj + k2 n ) — X(sj -1 + k2 n ) does not depend on 3 f 8j _ 1 +k2~ n ) 

00 

= Y A ^[An{T n = k2~ n }} 

k =0 

E [/ (X ( Sl + k2~ n ) - X ( k2~ n ) ,...,X(s m + k2 ~ m ) - X (s m _! + k2 ~ m ))] 
(X(sj + k2~ n ) — X(sj -1 + k2~ n ) has the same distribution as X(sj ) — X(sj_i)) 


00 

= ^P[dn{T n = fc2""}] E [/ (X ( Sl ) - X (0),..., X (s m ) - X (s^))] 

k =0 

= P [X n {T n < oo}] E [/ (X ( 5l ) - X (0),..., X (s m ) - X (s^))] 

= P [X n {T < oo}] E [/ (X ( Sl ) - X (0),..., X (s m ) - X (s^))]. (5.194) 


In (5.194) we let n tend to oo. Since the process X is right-continuous it follows 
that lim Y n (t ) = Y(t) P-almost surely on the event {T < oo}. By the continuity 

n—»oo 

of the function / the equality 


E [/ (Y(si), Y(s 2 ) - Y{ Sl ), Y(s m ) - Y(s m _ X )) lAn{T<oo}] 

= P [A n {T < oo}] E [/ (X ( Sl ) - X (0),..., X (s m ) - X (s^))] (5.195) 


follows. By taking, in (5.195), the function / of the form / = /o 0 V m , where 

V m : (E^) m -»• (E^) m is give by V m (x u ..., x m )) = {x u ..., x x +-b x m ) we 

see 

E [f 0 (^(si), • • .,Y(s m )) l^ n{T <oo}] 

= P [X n {T < oo}] E [/ 0 (X( Sl ),..., X(s m ))]. (5.196) 

Here /o : (M 1 ')” 7 —*• C is an arbitrary bounded continuous function. By passing 
to limits (5.196) follows for arbitrary bounded Borel measurable functions /o : 
—»• C. Via the monotone class theorem the assertion in (a) follows. 


(b) By taking the function / of the form / = fo°V m , where V m : (M I ') m —> (M l ') TO 
is given by V m (xi,..., x m )) = (x X) ..., x x -I- x m ) in (5.195), we get 

E [fo {Y (si), . . . , Y(s m )) l^ n {T<oo}] 

= P [X n {T < oo}] E [fo (X( Sl ),..., X(s m ))], (5.197) 

where fo : (M !/ )"' —> C is an arbitrary bounded continuous function. Then 
choose A = fl and divide by P {T < co}. We get 

E [f 0 (Y(s i),..., V(s m ))] = E [fo (X( Sl ),..., X(s m ))]. (5.198) 

Inserting the result in (5.198) into (5.197) entails 

E [fo (Y ( Sl ),..., r(s m )) 1 A ) = E [fo (Y (sx),..., V(s m ))] P(X). (5.199) 

From (5.199) it follows that the a-held 'Jr is independent of the one generated 
by {y(s) : s ^ 0}: for this employ the monotone class theorem. 

This completes the proof of Theorem 5.124. □ 
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7. Markov processes 

Let (Q, T, P) be a probability space and let X, Y and Z be stochastic variables 
on 0 with values in a topological Hausdorff space E. We assume that E is 
locally compact and that E is second countable, or, what is the same, that 
E satisfies the second countability axiom. In other words E has a countable 
basis for its topology. The space E is supplied with the Borel cr-field £ and 
we suppose that the variables X , Y and Z are measurable for the u-fields T 
and £. The symbol F x stands for the image measure on £ of the probability P 
under the mapping X. So P 'x(B) = P (X e B ), B e £. The symbol Pi is a 

probability kernel from 0 to E with the property that 

J P y | x (z, C)F x (dx) = P (Y e C, X e B) 

for all B and C in £. As function of the first variable the probability ker¬ 
nel P | is Px-alrnost surely determined. Putting it differently, the func- 

v x 

tion x i—► P y | x (x,C) is the Radon-Nikodym derivative of the measure B h-> 

P (Y e C, X e B) with respect to the measure B i—► P x(B) = P (X e B). In the 
following proposition we collect some useful formulas for (conditional) proba¬ 
bility kernels. 
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5.125. Proposition. Let (O, T, P), E, X, Y and Z be as described above. Let 
g : E x E —*• C be a bounded measurable function and let B and C belong to £. 

Then the following equalities hold: 


J J g(x,y)P Y \ x (x,dy)F x (dx) = E (g(X,Y )); 

(5.200) 

J P Y l x (x,C)P x (dx) =E(1 C (Y),X e B); 

(5.201) 

P y | x (X,C) = E(l c (yO|ff(X)); 

(5.202) 

J P z|y(y> c ) p y d v ) = F z x ( x > C )’ ^x-almost surely, 

(5.203) 


provided that E (Z | <t(T)) = E (Z | a(X, T)). 


Proof. The equality in (5.201) follows in fact from the definition of IP y | x - 

By choosing the function g of the form g(x,y ) = 1b(x)1c(v) in (5.200) we see 
that (5.200) coincides with (5.201). An arbitrary bounded measurable function 
g can be approximated by linear combinations of functions of the form (x,y) i—> 
lB(x)lc(y)i, with B and C in £. Let g : E —> E be a bounded measurable 
function. Then the following equalities hold: 

E(g(X)F Y \ x (X,C)^j = J^(x)P y | x (x,C')P x (da;)=E(^(X)l c (y)). (5.204) 

From (5.204) the equality in (5.202) follows. Let g : E —*• C be a bounded 
measurable function. Then by, among others, (5.202) the following equalities 
are true: 

J g(x) JP z | Y (y, C)P y \ x (x, dy)P x {dx) 

= E (p z | y (X, C)g(X)^j = E (E (1 C (Z) | a(Y)) g(X)) 

= E (E (l c (Z) | a{X, Y)) g{X)) = E (E (l c (Z)g(X) | a(X, Y))) 

= E (1 c {Z)g(X)) = J g{x)V z \ x (x, C)P x (dx). (5.205) 

From (5.205) the equality in (5.203) follows, and completes the proof of Propo¬ 
sition 5.125. □ 


5.126. Theorem. Let (fl,T, P) and (E,E) be as above. Let X = {X(t) : t ^ 0} 
be a stochastic process with values in the state space E adapted to the filtration 
(T t : t ^ 0) . So every state variable X(t) is a mapping from Cl to E, measurable 
for the a-fields T t and £. In addition, suppose that the family of operators 
{d t ■ t ^ 0} from Cl to Cl satisfies the translation property X(s)oi9 t = X(s+t ) for 
all s and t ^ 0. Then the following assertions are equivalent (for the implication 
(Hi) => (i) it is assumed that = o {X(u) : 0 < u < t)): 

(i) For every C e £ and every s and t ^ 0 the following equality holds: 

E [lc(X(s + t)) | 3y] = E [lc:(A(s + t)) | cr (X(t))] P-almost surely, (5.206) 


400 


Download free eBooks at bookboon.com 


Advanced stochastic processes: Part II 


Some related results 


(ii) For every bounded random variable Y : —*• C, that is measurable for 

3+ and £, and for every t ^ 0 the following equality holds: 

E [y o + | +] = E [y o + | cr (X(t))] P -almost surely; (5.207) 

(iii) For every meN and for all (m + 1)-tuple of bounded Borel measurable 
functions /o, • • •, f m '■ E —*• C the equality: 

E [/ 0 (X(0)) f\ (X( Sl )) ...f m (X(s m ))] (5.208) 

fo +o) fl(xi) ■■■ fm (x m ) 

m+1 times 

P , i dx m )... P , | / (x 0 , dxi)F X (o)(dx 0 ), 

X(s m )|x(s m _i) V m 15 mJ X(si)|jC(0) V u ’ iy AWV u; ’ 

holds for every s\ <■■■< s m in [0, go). 

If the process X is right-continuous, then (i) and (ii) are also equivalent 
with the following assertions: 

(iv) For every bounded Borel measurable function f : E —*• C and for every 
stopping time T : fl —> [0, go] the following equality holds P -almost 
surely on the event {T < go}; 

E [f(X(s + T)) | 3F T ] = E [f(X(s + T))\a (T, X(T))] ; 

(v) For every bounded random variable Y : fl —> C, which is measurable 
for 3+, and for every stopping time T : fl —> [0, go] the equality 

E [Y o id T | J T ] = E [Y o tf T | <J (T, X(T))] (5.209) 

holds P -almost surely on the event {T < oo}. 

If the process X is right-continuous and if as filtration the internal history is 
chosen, then all assertions (i) through (v) are equivalent. 



Proof, (i) => (ii). Upon invoking the monotone class theorem it suffices to 
prove (ii) for functions Y : fl —> C of the form Y = njli/j (X(sj)), where the 
functions fj, 1 U j U rri are bounded and measurable. For m = 1 (i) is clearly 
equivalent with (ii). Next we prove (ii) for Y = Y\q=i fj (X(sj)) starting from 
(ii), but with Y = YlU ^ ( X ( s i))> with 1 < k < m. The equalities below then 
show that (5.207) follows for Y = Y\^=i fj (X(sj)): 


E 


m+1 


3 = 1 

= E 


n/u^fe+*))u. 

( m+1 

n fi w s i +o) i 

3 =i 
m 

n*(vfe+t))E (/m+1 {X(s m+ 1 + t)) | + m+t ) | + 


Sm+t 


% 


= E 


.3 = 1 
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(the equality in (5.207) for Y = f m+1 (X(s m+ i))) 


= E 


n/uxfe+ojE [/m+1 (X(Sm +1 + t)) | G ( X (s m + £))] | T t 


J = 1 


(the equality in (5.207) for Y = YYjLi 9j ( X(sj )), where gj = f } . 1 ^ j ^ rri — 1, 
and where g m (x) = f m (x) S/».+ito)P x( , ii>+1+t) | x( ,„ + « | (*,<%)) 


= E 

= E 

= E 


n/jPffe+‘))E (/m+1 (I(w + 0) I {*m + t)) I a(X(t)) 

.3 = 1 
m 

n/ 3 (^fe+i))E (/m+1 (X(s m+1 +f)) |+ m+t ) |+X(f))| 

.3 = 1 

(m 

11 /?' (^( S i + t)) fm +1 +(^m+l + + 

i=i 


J. 


5m+^ 


+X(f)) 


= E 


n /i ( x (^+/-+i (^(wi+*)) i 

_j= 1 


(5.210) 


Then observe that (5.210) is the same as (5.207), but for T = njlV /j (^( 5 j))- 
This proves the implication (i) => (ii). 


(ii) => (i). This implication follows by putting Y = lc(X(s)). 
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(ii) => (iii). The equality in (5.208) is correct for m = 0 and for m = 1. This is 
a consequence of Proposition 5.125. Again we will apply induction with respect 
to m. We assume that (5.208) is correct for m and for the increasing m-tuple 
si < S 2 < ■ ■ ■ s m . Then we see 


E 


m+1 


n fi 


3=0 


E 


f m+1 


e n ^ i ^ 


J =o 


= E 


= E 


n fi < A fe)) E (/™+i (*(Wi)) I 


_j =0 

m 


Y\f j (X(s j ))E(f m+1 (X(s m+1 )) | a(X(s m ))) 
U=o 


(Proposition 5.125) 


III r> 

n/i /m+1 ( X m+l) 

3=0 J 

= J J • • • J Mxo) ■ ■ ■ fm+l(x m +l)P 


P X( Sm+1 )|x( Sm ) ( X ( S m),dx m+ 1) 


5 m+l )+( s m) 


{Xm ? dXryn-^ i j 


m+2 times 


P x( si )|x( 0 ) ^°’ dxi ) P *(°) • (5.211) 

From the equality in (5.208) for m the equality in (5.200) follows for m + 1 
instead of m. 


(iii) =^> (i). Let C e £ and let s and t > 0. Starting from (iii) we will prove that 
the following equality holds P-almost surely: 

E (1 c (X(s + t)) | +) = E (1 c (X(s + t))\a (X(t))) . (5.212) 

Choose 0 < t\ < ■ ■ ■ < t m = t and choose bounded Borel measurable functions 
fo, ■ ■ ■, fm- The following equality is a consequence of (iii): 


E (fo(Xo) ...f m (X(O) 1 c(X(s + t))) 

• • • J J /o+o) • ■ ■ /m+m+C+ 771 + 1 ) 


m+2 times 


a , (x m ,dx m+ i)P , i , (x m -i,dx m )... 

X(s+t)\X(tm) V m ’ m+ ’ X(t m )|x(t m _i) V 171 15 m> 

(x 0 ,dxi)P X (o){dx 0 ) 


P 


X(ti)|X(0) 

= J J J J 


m+1 times 

p i 

X(s+t) \X(tm) 


(x m ,C)P 


X{tm) | X{tm — 1 ) 

p x(ti )| x( 0) (oco.dx^Fx^dxo) 


(^m—1 1 dXjn^j 
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= E 


/o(A'o) .../•» «(„)) P x(l+t) | x(t) (A(«„), C) 


(5.213) 


The monotone class theorem applies to the effect that (5.212) follows from 
(5.213), provided that the internal history is chosen as filtration. 

(iv) => (v). By the monotone class theorem it suffices to prove (ii) for functions 
F : VL —> C of the form Y = Y\y=i fj ( x ( s j))i w h ere the functions fj, 1 < j < m 
are bounded and measurable. For m = 1 it is clear that (iv) is equivalent 
to (v). We prove (v) for Y = fj ( x ( s j)) starting from (iv), but with 

Y = Yl k j=i fj ( x (•+), for 1 ^ k T rn. The following equalities show that the 
equality (5.209) then follows for Y = YXf=\ fj ( x ( s j)) : 


E 


m+1 


n f> ( x (*j +o) i s i 


.0 = 1 


= E 


E 


' m +1 


E n fj + T)) I I 3i 


J =1 


n/U A fe + r))E [/m+1 (X(s m+1 +T)) | + m+r] | 

.3 = 1 


(apply equality (5.209) for F = / m+ i (X(s m+ i))) 


= E 


n/u A fe+r))E [/m+1 +(^m+i + T)) | a (s m + T,X (s m + T))] | Tj 
J=i 


(use equality (5.209) for F = YYf=\9j (-T(sj)), where gj = fj,l^j^m — 1, 


and where g m (x) = f m (x) $ f m+ i(y)¥ 


(<Sm+7 1 ,X(s m +i+T)) (s m +T,X(s rn +T)) 


(x,dy)) 


= E 


E 


= E 


= E 


n/j(A'fe+T))E [/m+1 (X(s m+1 +T))\a(T,X(s m + T))] 
J = 1 


ff (T,A(T)) 


n/ 3 ( A fe+T))E [/m+1 (X(s m+1 +T)) | + m+T ] \a(T,X(T)) 

.3 = 1 


E 


fl fj ( x ( s j + T)) f m+1 (X(s m+1 + T)) | + 


Sm+T 


.3 = 1 


ff (T, X(T)) 


n A ( A fe + r )) /™+i (A(s™+1 + T)) I X(T)) 
J = 1 


(5.214) 


Then realize that (5.214) is the same as (5.209) for F = HJlV /j ( x ( s j))- This 
proves the implication (iv) => (v). The implication (v) => (iv) is again trivial. 
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(i) =^> (iv). By the fact E satisfies the second countability axiom, and by the 
fact £ is the Borel field it suffices to prove (iv) for functions / 6 Cq(E) instead 
of 1 c (verify this precisely). So we have to show the following equality: 

E [f (X(s + T)) | ? T \ 1 { t<oo } = E [f (X(s + T)) | a(T, X(T))] 1 {T<00} , (5.215) 

for / e Co(E) and for s ^ 0. By employing the right-continuity of paths, it 
suffices to prove (5.215) for the stopping times T n := 2~ n \2 n T], n e N, instead 
of T. The equality for T then follows from those of T n by letting n tend to go. 
For this notice that 0 < T — T n+ i ^ T — T n ^ 2~ n . Choose the event A e T^ . 
Then the event A n {T n = k2~ n } belongs to 3 r k2~ n and the following equalities 
hold: 


E[/ (X(s + T n )),An{T n = k2~ n }] 

= E [E (/ (X(s + T n )) | T fc2 -n) , A n {T n = k2 ~ n }] 


= E 
= E 


J Xk2-"),dv) , A n [T n = *2-}_ 

“ " J / < ! ') P X(. + T.<„))|X(T„(»)) ( X Vn)(u),dy) l(^(T„= t 2-»))M • 

(5.216) 
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We also have 


E [f(X(s + T n )), A n {T n = k2~ n }] 

= E [E (/ (X(s + k2~ n )) | J fc2 -n) , An {T n = k2~ n }] 


(because of (i)) 

= E [E (/ (X(s + k2 ~ n )) | a (X ( k2 ~ n ))) , A n {T n = k2~ n }] 



(X(T n )(u>),dy) l{An{T n =k 2 -n}}(cd) ■ 

(5.217) 


We see that (5.216) and (5.217) are the same. It follows that the assertion in 


(iv) is proved for T n instead of T. By letting n tend to go we then obtain (iv) 
for T (by employing the right-continuity of paths of the process). 


So the proof of Theorem 5.126 is complete now. 


□ 


We continue with some definitions. 

5.127. Definition. Let (12, T, P) be a probability space, and let E be a locally 
compact Hausdorff space with a countable basis for its topology. In addition, let 
(3y : t ^ 0) be a filtration on 12. Let X = {X(t) : t ^ 0} be a process attaining 
values in E. The state space E is equipped with the Borel filed and it is assumed 
that X is an adapted process. Suppose that for every x e E the (sub-)probability 


kernel P i (x, C), C e £, is defined. Here the (sub-)probability kernel 


kernel P 


.Vi* • / j | a r o ’ n ' ... 

’¥ > y\x{ x ,C) possesses the following defining property: 



where B and C are Borel subsets of E and where X and Y are stochastic 
variables with values in E. In addition, it is assumed that there are so-called 


translation operators : 12 —> 12 with the property that X(s)o t) t = X(s + t) for 


all s, t ^ 0. Moreover, by hypothesis the process X is cadlag. We say that the 
process X is a Markov process if for every C e £ and every tY 0 the equality 


E [1 c (X(s + t )) \%]=E [1 c (X(s + t )) | a(X(t))] (5.218) 


is P-almost surely true for all s ^ 0. The process X is called a strong Markov 
process if equality (5.218) also holds for stopping times. More precisely, if for 
every s ^ 0, for every C e £ and for every stopping time T : 12 —> [0, go] the 
equality 


E [lc(X(s + T)) | T r ] = E [l c (X(s + T)) | cr(T, X(T))] 


holds P-almost surely on the event {T < 00 }. If the process X is cadlag is, then 
a Markov process is automatically a strong Markov: see Theorem 5.126. We 
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say that a Markov process X is time homogeneous if for all C e £ and for all s 
and t ^ 0 the equality 


P , i ,Ax,C) = P . , (x,C) (5.219) 

is true for all x e E. In what follows we always suppose that X is a cad- 
lag, time homogeneous Markov process. Furthermore we define the operators 
{ P(t) : t ^ 0} via the formula 

[PM/] M = J f(y) p xw |* (0) M< i ?/)- (5.220) 

Here s ^ 0 and / belongs to Cq(E). Since we have (see equality (5.203) in 
Proposition 5.125) 

J P *(, + o|*(d»' C) %|*(»)^> " P ,(. + «)|.V(»)^' C )’ P ^(»)- almost surely ’ 

(5.221) 

we get, for a time-homogeneous Markov process X the following equalities: 

[PMPM/] (I) = | [PM/] (y)P Jt(s) | x(0| (x, dy) 

“II nz)v xm\xm (v ’ dz)r M,)\xm (x ’ dv) 


(X is time homogeneous) 


11 •fW P .Y(,«) \x M ^ dz '> V x M \x ( o ) ^ d v'l 


(employ equality (5.221)) 


"II /(s)P x(» +i )|x(0)( 1 ' dz ~> = [ p < s + W M 

The cadlag property of X implies lim s 10 [P{s)f] (x) = f{x) for all / e C 0 {E) and 
for all x e E. If P(s)f belongs to Cq(E) for every f e Cq(E) and for every s ^ 0, 
then the family \P(t) : t E 0} apparently constitutes a Feller semigroup. Put 
P(s ,x,C) = P^ )lx(o) ( x ,C), s ^ 0, x e E, C e £. Let the expectation values 

of Ea;(y), x e E, Y = njli fj (X (sj)), Si < s 2 < ... < s m , be determined by 
the formula: 




\3 = 1 


r> p m 

J J WA^P^x, 


dx \)... P (s TO s m _i, x m —\, dx m ). (5.222) 


Instead of (5.222) most of the time we write E X (F) = E [V | X(0) = x], for a 
bounded stochastic variable Y. Since X is a time homogeneous Markov process 
we see that the following equality also holds P x -almost surely: 

E x (Yod t \? t ) =E x( q(r), (5.223) 
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for all t ^ 0 and for all bounded random variables Y. The equality in (5.223) 
is first proved for random variables Y of the form Y = -i where 

the functions fj , 1 < j < m, are bounded Borel functions. Equality (5.223) is 
also true if P.,. and E x are replaced by P and E respectively. 

5.128. Remark. The expectation value E X (Y) is in fact the Radon-Nikodym 
derivative of de measure B *—» E [Y, X (0) e B] with respect to the measure B h-> 
P[X(0) eB], If in this definition we take for Y the variable Y = Iq (X(s)), 
then we obtain the probability kernel ^^ o (x,(7). Hence, these quanti¬ 
ties are defined as Radon-Nikodym derivatives. So, in general, the expression 
P^( ) | A . (0) (z,C') is not defined for every x e E. However, we will assume that 

these probability kernels exist for every x e E indeed, and that the correspond¬ 
ing semigroup is a Feller. Many authors define a (time homogeneous) Markov 
process X relative to a family of probability measures (P x : x e E) by means of 
the following equality: 

E x (Yo# t \? t ) =E m (Y), (5.224) 

P,, -almost surely for all x e E, for all t ^ 0 and for all bounded random variables 
Y : 12 —> C. In fact we also do this. In the time homogeneous case the equality 
in (5.224) also holds for stopping times T: 

E x (Yo# t \? t )=E x{t) (Y), (5.225) 

P x -almost surely on the event {T < co}, provided that the process X is cadlag. 
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The equality in (5.225) can be proved in the same manner as equality (5.209) in 
Theorem 5.126. Therefore pick / e Cq(E) and a stopping time T : 0 —> [0, go]. 
Consider the stopping times T n := 2~ n \2 n T], n e N, instead of T. Then, for an 
event A e 3~r n , we have 

Ea: [/ (Al( s + Tn)) 1 An{T n =k2~ n }\ 

= E x [/ (X( S + k2~ n )) l A n{T n =k2-n } ] 

= Ea; [E* (/ (AT(s + k2 n )) | 3 r k2~ n ) lAn{T n =fc2- n }] 

= E x [E x . (/ (X(s + k2 n )) | lnn{T n =fc2-’»}] 

= E X [E x{k2 - n) (/ (X( S ))) ^-An{T n =k2~ n }~\ 

= [Ex(T„) (/ (^( s ))) lnn{T u =fc2-"}] • (5.226) 

From (5.226) it follows that (5.225) for Y = f(X(s)) and for T n in the place of 
T. By taking the limit in (5.226) for n —*• oo the equality in (5.225) follows for 
Y = f(X(s)). Precisely as in the proof of the implication (iv) => (v) in Theorem 
5.126 the equality in (5.225) then follows for arbitrary random variables Y : 12 —> 
C, which are bounded and measurable for the a -field Too- 


8. The Doob-Meyer decomposition via Komlos theorem 

Let (Q, T, P) be a probability space, let {T f : t Y ()} be a right continuous filtra¬ 
tion in T and let {X(t) : t Y 0} be a real-valued Tj-submartingale. The Doob- 
Meyer decomposition theorem states that there exists an T r martingale { M(t ) : 
t Y 0} together with an increasing predictable adapted process {A(t) : t Y 0}, 
which is right continuous P-almost surely, such that X(t) = M(t) + A(t), t Y 0, 
provided that the process {X(t) : t Y 0} is of class (DL). The latter means that 
for every t > 0 the family {X(r) : 0 Y t Y t.T stopping time} is uniformly inte¬ 
grate. Moreover this decomposition is unique in case we assume that A(O) = 0. 
By Doob’s optional sampling theorem every martingale is automatically of class 
(DL) (see e.g. Ikeda and Watanabe [61], p.35, Ethier and Kurtz [54], p.74). An 
interesting discussion of the Doob-Meyer decomposition and (sub-)martingale 
theory can be found in Kopp [74]. For a nice account of the Doob-Meyer de¬ 
composition theorem the reader may also consult van Neerven [148]. 

We shall employ the following result of Komlos [73]. In fact it can be interpreted 
as kind of a law of large numbers. 

5.129. Theorem (Komlos). Let {f k : k e N} be a sequence in L 1 (f2,T,P) such 
that 

sup (E (|/fc|) : k e N} < go. 

Then there exists an infinite large subset A 0 of N together with a function f in 
L 1 (f2,3 r , P) such that for every infinite subset A of A 0 

lim —- \ -— V fj = /, P -almost surely. (5.227) 

n^oo\An[l,n]\^jeAn[l,n] Jj J ' U K ' 
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Examples show that this limit need not be an Id-limit. Set Q = N with the 
discrete cx-field and with P {k} = 2~ k , k e N. Let {j\ : k e M} be the sequence 
defined by /& = 2 fc e^, k e N, where {e^ : k e N} is the sequence of the unit 
vectors. Then n -1 YTj=i fj 0 pointwise, but n _1 $Xq=i fj^ = 1, n e N. 

Standard results on continuity properties of submartingales yield the existence 
of a realization (version) which is continuous from the right and possesses left 
limits P-almost surely. Henceforth we shall assume that the dy-submartingale 
{X(t) : t P 0} is continuous from the right and has left limits P-almost surely. 
We shall prove that there exists a predictable increasing process {A(t) : t ^ 0} 
together with an infinite Ao of N such that for every infinite subset A of Ao and 
every t ^ 0 the variable A(t) is given as the limit: 

Alt) = lim —- \ -77^ Aj(t), (5.228) 

v J |A n [1, n\\ A-ljeAn[l,n\ K J 

where 

TW - {e (* (^) I - X (|) } . (5.229) 

Moreover the process {X(t) — Aft) : t P 0} is an dy-martingale. The limit in 
(5.228) is a point-wise almost sure limit as well as an Id-limit. 

Again let (12, £F, P) be a probability space, let {T t : t ^ 0} be a right-continuous 
filtration in T and let \X(t) : t P 0} be right continuous submartingale of class 
(DL) which possesses almost sure left limits. We want to prove the following 
version of the Doob-Meyer decomposition theorem. 

5.130. Theorem. There exists a unique predictable right continuous increasing 
process { Aft ) : t ^ 0} with A(0) = 0 such that the process { Xft ) — Aft) : t 5= 0} 
is an Jt-martingale. 

It is perhaps useful to insert the following proposition. 

5.131. Proposition. Processes of the form Mft) + Aft), with M a martingale 
and with A an increasing process in L 1 (f2, £F, P) are of class (DL). 

Proof of Proposition 5.131. Let (X(f) = M(t ) + A(t) : t ^ 0} be the 
decomposition of the submartingale {Xft) : t ^ 0} in a martingale { Mft ) : t ^ 
0} and an increasing process {Aft) : t ^ 0} with M(0) = 0 and 0 ^ r < t be any 
dy-stopping time. Here t is some fixed time. For JVeNwe have 

E(|X(r)| : |X(r)| ^ N) < E(|M(r)| : |X(r)| ^ N) + E (A(r) : |X(r)| > N) 

< E(|M(*)| : |X(r)| > N) +E(A(r) : |X(r)| ^ N) 

^ E {\M(t)\ + Aft) : |X(r)| ^ N) 

< E [ \Mft)\ + Aft) : sup |X(s)| ^ AM . (5.230) 

\ / 
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Since N x P{sup 0sSssSt |X(s)| ^ TV} < E(|X(£)|), it follows that 
lim sup{E(|X(r)| : |X(r)| ^ N) : 0 ^ r < t, r stopping time} = 0. (5.231) 

N^CO 

This shows Proposition 5.131 □ 

Similarly we have the following result. 

5.132. Proposition. Let {X(t) : t ^ 0} be an “Jt-submartingale. For any real 
number N the process {max(X(£), N) : t 5= 0} is an J t -submartingale which is 
of class (DL). 

Next we come to the heart of the matter. The symbol [x|, x e M, denotes the 
integer k with k < x ^ k + 1. 
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Proof of Theorem 5.130. It will be convenient to introduce the follow¬ 
ing processes: 

Xj(t) = e(x (fpl) | T) , t>0, je N; (5.232) 

4<*> - 2^, { E (* (^) i **») - x (|)} • < 5 - 233 > 

The processes {Aj(t) : t > 0} are right continuous and have left limits. The 
processes {Aj(t) : t ^ 0} are predictable in the sense that, for j, N in N, the 
functions (t,oj) >-* Aj(t,u) are measurable with respect to the a-field generated 
by the collection {l( a ,6] x A : 0 ^ a < b, A e T a }: see e.g. Durrett [44], p. 49. 
Moreover it is readily verified that the process 


{Xj(t) - Aj(t) : t > 0} (5.234) 

is an fly-martingale and that 

lim E (AJt) - AJt-)) = 0. (5.235) 

j~> 00 

Equality (5.235) is true because 

limE (X(s)) = E (X(t )). (5.236) 

S^t 

Equality (5.236) can be proved in the following manner. Put 

X"{t) = limE (X(t + h) | 5 t ) = inf E (X(t + h) | 3F t ). 

ftj.0 h> 0 

Then X"{t) ^ X(t), P-almost surely. The following argument shows that 
X"(t) = X(t), P-almost surely. Define for m e N the stopping time r m by 

r m = inf{s > 0 : |X(s)| > m}. 

Then, P-almost surely, r m \ oo. Moreover, we have 


E [X'\t) - X(t) : r m > t] (5.237) 

= limE [E (X(t + h) | 3F t ) - X(t) : r m > t] 

hi 0 

= limE [E ((X(t + h) - X(t)) l iT „>„) | 7,] 

= lim E \X(t + h) — X(t) : r m > t] 

HO 

= lim {E [X(t + h) — X(t) : r m > t + h] 

hi 0 

+E [X(t + h) — X(t) : t < r m ^ t + h \} 

< hm {E [X(t + h) — X(t) : r m > t + h\ 

+E [E (X(t + 1) | 9y + /j) — X(t) '■ t < T m ^ t + h]} 

= hm {E [X(t + h) — X(t) : r m > t + h\ 

+E [X(t + 1) — X(t) : t < r m < t + h]} = 0, 
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by dominated convergence (twice: on {r m > t + h] we have | X(t + h) — X(t )| < 
2m, P-almost surely). Consequently 

0 < E (X"(t) - X(t)) = lim E (X"(t) - X(t) : r m > t) = 0, (5.238) 

771 —> 00 

and hence X"(t) = X(t), P-almost surely. We also infer 

E (X(t)) = E(X"(t)) = E flimE (X(t + h) \ 3 t )^X(t) ) +E(X(t)) 

\h[0 J 

= E flimE ((X(t + h) - X(t)) I T t )) +E (X(t)) 

\H0 J 

= lim E (E ((X(t + h)~ X(t)) | T t )) + E (X(t)) 

h[ 0 

= limE (X(t + h)~ X(t)) + E (X(t)) = limE (X(t + h )). (5.239) 

hi 0 hi 0 

This proves (5.236). In addition we write 

f(t)=E(X(t)-X(0)) (5.240) 

and we define the countable dense subset D of [0, oo) by 

D = {f^0:teQ}u{f^0: f(t +) > f(t —)} . (5.241) 

(Notice that the functions / is increasing.) 


Let A 0 be any infinite subset of N and let {A.A 0 (t) : t e D} be a process such 
that for every infinite subset A of A 0 and P-almost surely, 

(t) = lim —-1- =t V* An(t), t 6 D. (5.242) 

AoW n^oo |A n [l,n]' „i n )■> \ ) 


By Komlos’ theorem (Theorem 5.129) and a diagonal procedure such a subset 
A 0 exists. We shall prove that for t e D the limit in (5.241) also exists in L 1 - 
sense. In view of a theorem of Scheffe (Corollary 2.12.5 in Bauer [ 10 ], p. 105, 
it suffices to prove that 

E(4(f)) = lim —- 1 - -V EfAJt)). (5.243) 

V A ° W ' n -00 |A n [l,n]| ^Jj6An[l,n] V V ’ 

It is readily verified that 

= E (x ( 1 ^)) - E (AT(0)), (5.244) 

so that 

lim- - 1 - -V E (AJt)) = f(t+), teD.) (5.245) 

n^oo |An [l,n]| 4eAn[l,n] V 3\ >> J \ )i ) V ) 

On the other hand we have, by Fatou’s lemma, 

E (A\ 0 (t)) ^ liminf —-—— -r- V E (Aj(t)) = f(t+). (5.246) 

V AoW ' n^oo |An [l,n]| ^jeAn[l,n\ v 3K>> Jy ; v ’ 

In addition we have for A > 0 


E (A Ao (t)) ^ E 


1 




lim sup 7l r , 

n ^o 3 |A O [1, 97.] | ^-0'eAn[l,n 


A j(t ) 
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^ E [ lim sup —-— _. / 

n^oo IA n [1, n]| ^ieAn[l,n] 


■ 2 , 


Aj (min (f, t a ) -) , (5.247) 


where r A is the stopping time defined by 

1 


T\ = inf < s > 0 : s e D, sup 7 , r n , , 

^ neN |An[l,n]|^eAn[l,n] 


2 ., 


Aj(s)> AL (5.248) 


Since 


1 


2 * 


Aj (min (t, r A ) -) < A, 


(5.249) 


|A n [l,n]| ^- | ieAn[i,n] 
we infer from (5.247) and (5.235) that, for any A > 0, 

E (A Ao (t)) ^ lim sup 1 XL An ri „i E (4* ( min (*’”)) 
n—>co |A n [l,nj| ^^ eAn L 1 ’ ? u 

^ E (X(t) : r A > t) + E (X (min(r A , t)) : r A < t) - E (X(0)). 

(5.250) 

Since r A | oo, P-almost surely, as A tends to infinity, we infer from (5.250) 
together with the fact that the collection {X{r) : r ^ t, r stopping time} is 
uniformly integrable, 

E (A Ao (t)) >E(X(t) - X(0)) = /(f). (5.251) 
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(In fact in (5.250) we first take the sum, then we write fl = [r\ > t} u {t\ ^ t}.) 
The right continuity of the submartingale {X(t) : t A 0 } together with (5.236) 
implies the equality 

fit) = f(t+). (5.252) 

Hence the equality in (5.243) now follows from (5.251), (5.252) and (5.246). So 
the limit in (5.241) is also an Id-limit. Since the submartingale {X(t) : t A 0} 
is continuous from the right we also deduce 


limsupE (|X, (t) — X(f)|) 

j *oo 


lim sup E I X 

j *co V 

lim/(s) 

sit 


m 


\Vi\ 

2 i 

- 0 . 


E(X(t)) 

(5.253) 


Hence the L 1 -convergence in the equality 
|A n [ 1 , n]| 2 X &) 

L J jeAn[l,n] 

|A n [1, 77 .] | ^ieAn[l,n] 3 ^ ^ |A H [1, 77 ] | 


qeAn[l,n] 


A(t) 


yields 

X(t) = M\ 0 (t ) + A\ 0 (t), t e D, (5.254) 

where the process {AA 0 (t) : t £ D) is increasing and predictable. We shall extend 
(5.254) to all t A 0 and we shall prove that the process {A.A 0 (t) : t £ D) has right 
continuous extensions to all of [0, oo). In order to achieve this fix to f D, to > 0, 
and let s, t be arbitrary numbers in D with 0<s<to<t<cc. Then 

Avds) A lim inf —- ? -— V 1 AAs) 

n—HX) | A n [1, 77] | ^- 6 'eAn[l,n] 


< lim inf —- 7 -— y AAtn) 

n—>co | A O [1, 77] I -MjeAn[l,n] 3 

^ lim sup 

n—> oo 


(5.255) 


1 77-F-77 ^ A i(t) A A.A n (t). 

| A n [1,77] | An[l,n] 3 

From (5.255) it follows that 

E | lim sup 77 -p- 77 V AAto) — lim inf 7- - 7 - 77 V* AAtn) 

V n ^oo |A n [1, 77 ] | ^-b'eAn[l,n] 3 n->co | A n [1, n\ I ^-b'eAn[l,ra] 3 

A E (A Ao (t) - A Ao (s)) = E (X(t) - X(s)) = fit) - f(s). (* ™ 

So that 

1 ^ . 1 


(5.256) 


(5.257) 


E | lim sup 7 -- 7 - 77 y AAtn) — lim inf 77 - 7 - 77 y AAt 0 ) 

\ n _>oo |A O [1, 77 ] | An[l,n] 3 n->co |A O [l,n]| A-ljeAn[l,n] 3 

A f(t 0 +) ~ /(to-) = /(to) - /(to) = 0, 
since to does not belong to D. Hence, for every to A 0, 

l, A „ 1 | T . .. - AM, n) 


= lim inf 


(k 


n —>nn 
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P-almost surely. In addition, as above we also have 

E(A Ao (t)) = f(t), t> 0,) (5.259) 

and hence 

E(A Ao (f) - A Ao (s)) = f(t)-f(s). (5.260) 

So that the process {/L\ 0 (t) : t ^ 0} is almost surely right continuous. Again we 
have decomposition (5.254) for all t ^ 0. From (5.236) and (5.258) it follows 
that, P-almost surely, 

^-Ao {to) = lim —- Aj(t 0 ~) 

n—>00 |A n [l,nj| ,n\ 

and consequently the process {A Ao (t) : t ^ 0} is predictable. □ 


The uniqueness of the Doob-Meyer decomposition does not depend on the (DL)- 
property. So the processes (M Ao (f) : t ^ 0} and (A Ao (f) : t 5= 0} do not depend 
on the particular choice of A 0 . Henceforth we write 


X(t) = M(t ) + Aft), t A 0, (5.261) 

where { M(t ) : t ^ 0} is an cb-martingale and where {A(t) : t ^ 0} is an increas¬ 
ing right continuous process which is predictable. Proposition 5.131 shows that 
the process {X(t) : t ^ 0} must possess the (DL)-property. Let D 0 be the count¬ 
able dense subset of [0, oo) given by 

D 0 = {t e Q : t ^ 0} u {t ^ 0 : f(t +) > /(£—)} (5.262) 


and choose A 0 <= N, |A 0 | = oo, and the process { B(t ) : t e D 0 } in such a way 
that for every infinite subset A of A 0 , 


B(t) = lim 




n—>co |A n [l, 7 l]| AjjeKr>{l,n\ 


Aj(t)i t e D 0 . 


(5.263) 


Then, as in the case of {Aj(t) : j e N} it follows that the convergence in (5.263) 
is an L 1 -convergence as well. Again as above the convergence in (5.263) occurs 
for all t ^ 0. Consequently the process {X(t) — B(t ) : t ^ 0} is a martingale, 
because the processes {X 3 {t) — A 3 {t ) : 1 > 0}, j e N, are martingales. Here 


*;(*) 


E 


X 




These remarks prove the following corollary. 


5.133. Corollary. Write a submartingale {X(t) : t ^ 0} in the form X(t) = 
M(t ) + A(t), t ^ 0, where the process { M(t ) : t ^ 0} is a martingale and where 
{A(t) : t ^ 0} is a right continuous increasing predictable process with A(0) = 0. 
Then there exists an infinite subset A 0 of N such that for every infinite subset 
A of Ao and every t ^ 0: 

Alt) = lim — -p- rry Aj (t) . (5.264) 

n—>co |A n [l,7l]| ^ J jeAn[l,n] 
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and the convergence in (5.264) is a P-almost sure as well as an Id-convergence. 
Of course the process {A(t) : t > 0} does not depend on the particular choice of 
A 0 for which all the limits in (5.264) exist. 


Next the uniqueness part of the Doob-Meyer decomposition will follow from 
Proposition 5.134. 


5.134. Proposition. Let Z = {Z(t) : t ^ 0} be a bounded martingale and let 
A = {A(t) : t ^ 0} and {B(t) : t ^ 0} be adapted increasing processes such that 
B — A is a martingale. Also suppose that E(A(£)) < go, for t ^ 0. Then 

E [Z(t+) ( B(t+) - A(t+)) - Z{ 0) (5(0) - A( 0))] 

= E^J (Z(s+) - Z(s-))d(B - A)(s) S j . (5.265) 


5.135. Remark. The integral (Z(s+) — Z(s—)) d{B — A)(s) should be inter¬ 
preted as follows: 


\\ Z (s+)-Z(s-))d(B-A)(s) 

Jo 

rGO rCO 

- (Z(»+) - Z(s-)) 1(0,,](s)<iB(») - (Z( s +) - Z(s-)) l m ,( s )dA( s ). 

Jo Jo 


Proof of Proposition 5.134. Let n e N. Since Z is a martingale we 
have: 


E [Z ([2 n L|2 -n ) (B ([2 n L|2 -n ) - A (\2 n t]2~ n ))] - E [Z( 0) (5(0) - A( 0))] 


= E 


2 Z((j + 1)2-") 

0^j<[2 n £] 


{(B (U + 1)2-") - B 02-")) - (A (0 + 1)2-") - A (j 2"“))} 


Since B — A is a martingale, it follows that: 

E [Z (\2 n t]2~ n ) (B ([2 n L|2 -n ) - 5(0) - A ([2 n t]2“ n ) + A{ 0))] 

- E [Z(0) (B(0) - A(0))] 

= E (L„^ m ( Z (W+ 1 ) 2 “”)-Z0’2-")) 

((B ((j + 1)2-") - B (j2-")) - (A (0 + 1)2-”) - A (j2-)))). 

Put 

Z:(s) = Z ([2" S )2-") = o Z ((j + 1)2-") l fa2 -~,o +1 ) 2 -.]( S ), and 

Z-(s) = Z (([2"s| - 1) 2-") = 2“ o Z (j2-“) l 02 -~.(j + i)2-.](a). 
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Then we obtain 

E (Z (\2 n t\2~ n ) ( B (|2"t]2 -n ) - A (\2 n t]2~ n ))) - E [Z( 0) (B( 0) - ,4(0))] 

= E (J“ (ZRs) - Z-(s)) l,„, |2 ., |2 -»]( S )<i(B( S ) - A( S ))) . 

So, upon letting n tend to infinity, Proposition 5.134 follows. □ 

5.136. Proposition. In addition to the hypotheses in Proposition 5.134, sup¬ 
pose that the martingale B — A is predictable. Then B(t+ ) = A(t+) P -almost 
surely. So that, if B — A is right-continuous, then B = A P -almost surely, 
provided B( 0) = ,4(0) = 0. 


Proof. First we prove that E (Z(s+) — Z(s—)) d(B — ,4)(s)^ = 0. Here 

we shall employ the predictability of the process B — A. It suffices to prove that, 
for all s > 0, 

E ((Z(s+) - Z(s-)) ( B{s+ ) - A(s+) - B(s-) + A(s-))) = 0. (5.266) 

Since the predictable field on O x [0, go) is generated by the collection {C x (a, b] : 
C e £F a , 0 < a < b} it suffices to prove (5.266) for all s ^ 0 if B — A is of the 
form 

B(s) - A(s) = l c x l (a+e ,oo)(s), C e %. (5.267) 

So let C belong to T a and let B(s) — H(s) = 1 px l( a+e00 )(s). Then, for s = a + e 
(and C e T a ), we have by the martingale property of Z, 

E(Z((a + £)+)-Z((a + £)-)l c ) 

= E (E (Z(((a + e)+) — Z ((a + e)-) | T a ) l c ) 

= E((Z(a)-Z(a))l c ) = 0. (5.268) 

Notice that, for s = a + e, 

E ((Z(s+) - Z{s-)) ( B(s+) - A(s+) - B(s) + A(s ))) 

= E ((Z ((a + e)+) — Z ((a + e )—)) lc). 


From Proposition 5.134 it now follows that 


E (Z(t+) ( B(t +) - A(t+))) = E [Z( 0) (B( 0) - ^(0))] = 0. 


Next, fix t > 0 and define the martingale Z(s) by 


Then 


Z(s) = E 


B(t+) - A(t+) 
\B(t+) — A(t+)\ + 1 



0 = E (Z(t+) ( B(t +) - A(t+))) = E 


f \B(t+) - Mf+y^ \ 
y | i?(t+) — ,4(f+)| + 1 J 


and hence B(t+) = A(t+), P-almost surely for all t ^ 0. It also follows that 
B(t—) = A(t—), P-almost surely for all t > 0. If the process B — A is right 
continuous almost surely, we infer B(t) = A(t), t A 0, P-almost surely. This 
completes the proof of Proposition 5.136. □ 
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As a special case the following result contains the uniqueness part of the Doob- 
Meyer decomposition theorem. 

5.137. Proposition. Let A and B be increasing adapted processes. Suppose 
that B — A is a predictable right continuous martingale. Then B(t ) = A(t) + 
5(0) — A(0), P -almost surely. 

Proof. This result is an immediate consequence of Proposition 5.134 and 
Proposition 5.136. □ 

5.138. Corollary. There is only one way to write a semi-martingale Y in the 
form Y = M + A, where M is a (local) martingale and where A is a predictable 
right continuous process of finite variation locally with A(0) = 0. 

5.139. Remark. An increasing, predictable right continuous real-valued process 
{A(t) : t ^ 0}, with E (A(t)) < oo for t ^ 0, is called a Meyer process. 


It is perhaps worthwhile to isolate the following result in the existence part of 
Doob-Meyer decomposition theorem: notation is that of the proof of Theorem 


5.130. We also use A\(t) 


1 

W\ 


A n (t), for a finite subset A of N. 

ne A 
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5.140. Theorem. Let {X(t) : t ^ 0} be a right continuous submartingale of class 
(DL). For every infinite subset A 0 of N there exists an infinite subset A of A 0 
such that for every further infinite subset A' of A, the limit 

A A (t) := ^lim A A ' n [i,jV](£); exists F-almost surely, 

and does not depend on the choice of A'. Moreover, since we are dealing with 
(DL)-submartingales, limjv->oo E [|Aa(£) — AA' n [i,iV](T)|] = 0. In addition, the 
process {A A (f) : t ^ 0} is predictable and right continuous. 


Proof. Write 

Q' = {t e : I e N} = {t > 0 : E (X(t+)) > E (X(t-))} |J (Q n [0, oo)). 
Define the measure /i on Q' by 

^ (/) = 2 2^ l + E (X (tf) — X(0))' 

Let A 0 be an infinite subset of N. Komlos’ theorem, applied to the sequence 
{A n (te) : I 6 N} n6N on the measure space {N x Q,1P(N) ® T,/r®P} applies to 
the effect that there exists an infinite subset A of A 0 such that for every further 
infinite subset A' of A, A\(t) = linpv^oo Avn[i,7V](Tr) exists for I = 1, 2,... and 
does not depend on the particular choice of A'. In addition, 

lim E (| A A (ti) - A A 'n[i,jv](b’)|) = 0- 

TV—kx) 1 17 

Next let t ^ 0 be arbitrary with E(X(t)) = E(X(t—)) = E(X(t+)) and let 
A' c A 0 , A' infinitely large. For t! < t <t" , t' , t" in Q', we have 


TV— XX) 


E lim sup A A ' n [i,iV] (t) ^ El limsup A A 'n[i,jV](0 


TV—xx) 


= E (liminf A A , n[ltN] (f)) A E (X(f) - X(0 )). 

V TV—>oo / 

Similarly we have 


E ^liminf A A 'n[i,7v](t)) A E (hminf A A 'n[i,iv](V)) 

= E f lim sup A A 'n[i, 7 V] (o') > E(X(t') - X(0)). 
V TV—>oo / 

Since E (X(t+)) = E (X(t— )), it follows that the limit 

A k i(t) := lim A A , n[ 1 N] (t) 

TV— XX) 

exists P-almost surely. Consequently, the limits 

A\(t) := lim A A ' n [i, 7 v](£), t > 0, 

TV— XX) 


all exist P-almost surely and limyv^ooE (|^4 A (^) — ^4A'n[i,7V](£)|) = 0. Finally we 
shall prove that the process {A\(t) : t ^ 0} is right continuous. Fix t 0 5 s 0 and 
let t > to. Then 


E (A A (t) - A A (f 0 )) = E (X(t) - X(t 0 )) A 0. 
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Since t >—► E(A(£)) is right continuous we infer that lim t p 0 E (Aa(£) — -4a( to)) = 
0. It follows that, P-almost surely, lim t ; to A\(t) = A\(t 0 ). 

This completes the proof of Theorem 5.140. □ 

5.141. Corollary. Let X = M + A be the Doob-Meyer decomposition of a 
submartingale into a martingale and an increasing right continuous predictable 
process A. Then, for an appropriate sequence (ne : I e N) in N, 

-j N oo 

m - lim - 2 2 ( E O (0 + I - -4 (i2-”‘)) «,)(()• 

k= 1 j =0 

This limit is an P -almost sure limit as well as a limit in T, P). 

PROOF. A combination of the existence and uniqueness of the Doob-Meyer 
decomposition yields the desired result. Notice that by Proposition 5.131 a 
process of the form M + A, where M is a martingale and where A is an increasing 
adapted process in L 1 (D,T, P) is of class (DL): see (5.230) and (5.231). So the 
proof of Corollary 5.141 is complete now. □ 

Another corollary is the following one. 

5.142. Corollary. Let {X(t) : t ^ 0} be a right continuous submartingale of 
class (DL) with left limits. Fix to > 0 and let {t^ : I e N} be sequence of stopping 
times which increases to the fixed time to- Suppose Ti < to, P -almost surely, for 
all I e N. Then E(|A(t 0 —)|) < co and lim^oo E (|A” (rf) — X (i 0 —)|) = 0. In 
addition, lim^o E (|X (to + h) — X (t 0 )|) = 0. 

The following result also follows from our discussion. 

5.143. Corollary. Let {X(t) : t ^ 0} be a submartingale. If the function t >—> 
E(X(t)) is P -almost surely continuous, then the process {A(t) : t ^ 0} is P- 
almost surely continuous as well. 

5.144. Remark. Several people have reformulated and extended Komlos’ result 
as a principle of subsequences, e.g. see Chatterji [30]. Others have treated an 
infinite dimensional version, e.g. see Balder [7]. In [96], Exercise 3, p. 103 the 
authors give an example of a submartingale which is not of class (DL). In fact 
Metivier and Pellaumail give the following example. Let O be the interval [0,1] 
with Lebesgue measure and let 0 = to < D < • • • < tn < ■ ■ • < 1 be a sequence 
such that lim n ^oot n = 1. Define the process X by 

X(t,u>) = ~^ n=1 2 n lp n _ 1 ,t n )(i)l(i_2-'*,i](<^), oj e [0,1], t ^ 0. 

Then A is a submartingale, X is not of class (DL) and A is a martingale on 
the interval [0,1). If t n _i < t < t n , we write for the cr-field generated by 
{(j — l)2 -n , j2 n ] : 1 A j A 2 n }. If t ^ 1, then % is the Borel field of [0,1], 

5.145. Definition. Let {Y(t) : t 5= 0} be a martingale in L 2 (f2,T,P). Then 
{|F(t)| 2 : t ^ 0} is a submartingale of class (DL). So by Theorem 5.130 there 
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exists a unique martingale { M(t ) : t F 0} with M(0) = |F(0)| 2 and an increasing 
predictable right-continuous process {(F) (t) : t F 0} in L 1 (fl,9 r , P) such that 

|F(f)| 2 = M(t) + (F) (t), P-almost surely. 

The process {(F) (t) : t F 0} is called the (quadratic) variation or variance pro¬ 
cess of {F (t) : t F 0}. 

5.146. Example. Let { B(t ) : t F 0} be p-dimensional Brownian motion. Then 
the process {t ^ ut : t F 0} is the corresponding quadratic variation process. 

5.147. Example, let t i—> \ ) t Q Fi(s) dB(s) and t >—> F^s) dB(s) be two local 
martingales. Then the process t >-*■ y o Fi(s)F 2 (s) ds is the corresponding covari¬ 
ation process. 
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Subjects for further research and presentations 

The following topics may be of interest for a presentation and/or further re¬ 
search: 


(1) Certain pseudo-differential operators of order less than or equal to 2 
can be put into correspondence with space-homogeneous or non-space- 
homogeneous Markov processes. A detailed exposition can be found in 
Jacob [ 62 , 63 , 64 ]. 

(2) Viscosity solutions to partial differential equations. The standard ref¬ 
erence for this subject is Crandall, Ishii, and Lions [ 35 ]. This topic 
can also be treated in the context of Backward Stochastic Differential 
Equations (BSDEs): see, e.g., Pardoux [ 109 ]. 

(3) Elliptic differential operators of second order (and Markov processes); 
see, e.g., Cksendael [ 106 ], 

(4) Parabolic differential operators (of second order and Markov processes). 
An interesting article in this context is Bossy and Champagnat [ 23 ]. 
The abstract of this paper reads: “We present the main concepts of the 
theory of Markov processes: transition semigroups, Feller processes, in¬ 
finitesimal generator, Kolmogorov’s backward and forward equations, 
and Feller diffusion. We also give several classical examples includ¬ 
ing stochastic differential equations (SDEs) and backward stochastic 
differential equations (BSDEs) and describe the links between Markov 
processes and parabolic partial differential equations (PDEs). In par¬ 
ticular, we state the Feynman-Kac formula for linear PDEs and BSDEs, 
and we give some examples of the correspondence between stochastic 
control problems and Hamilton-Jacobi-Bellman (HJB) equations and 
between optimal stopping problems and variational inequalities. Sev¬ 
eral examples of financial applications are given to illustrate each of 
these results, including European options, Asian options, and Ameri¬ 
can put options.” 

(5) Solutions to stochastic differential equations and the corresponding sec¬ 
ond order differential equation (of parabolic type) satisfied by the one¬ 
dimensional distributions. 

(6) Backward stochastic differential equations and their viscosity solutions; 
see, e.g. Pardoux [ 109 ], Van Casteren [ 147 ], Boufoussi and Van Cast¬ 
eren [ 24 , 25 ], Boufoussi, Van Casteren and Mhardy [ 26 ]. 

(7) Heat equation on a Riemannian manifold. A relevant book in this 
context is [ 59 ]. For connections with stochastic differential equations 
on manifolds see, e.g., Elworthy [ 52 , 53 ]. 

(8) Oscillatory integrals and related path integrals. There is a lot of lit¬ 
erature on this subject. Nice papers on this topic are Albeverio and 
Mazzucchi [ 1 , 2 ], Interesting books are, e.g., Mazzucchi [ 95 ], Johnson 
and Lapidus [ 65 ], and Kleinert [ 69 ]. 

(9) Malliavin calculus, or stochastic calculus of variations, and applica¬ 
tions to regularity properties of integral kernels. For details see e.g. 
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Nualart [ 103 , 104 ], Other references which contain results on and ap¬ 
plications of Malliavin calculus include: Cruzeiro and Malliavin [ 36 ], 
Stroock [ 127 , 128 , 129 ], Cruzeiro and Zambrini [ 37 ], [ 38 ]. Of course 
the original work by Malliavin should not be forgotten: [ 92 ]. The book 
by Bismut [ 18 ] combines Malliavin calculus with the theory of large 
deviations. For a discussion on Malliavin calculus in relation to Levy 
processes see, e.g., Osswald [ 108 ]. A rather elementary approach to 
Malliavin calculus can be found in Friz [ 56 ]. For application to sto¬ 
chastic differential equations see, e.g., Takeuchi [ 135 ]. For applications 
of Malliavin calculus to operator semigroups see, e.g., Leandre [ 83 , 84 ], 
For Malliavin calculus without probability theory see [ 82 ], 

(10) Books and papers with literature on financial mathematics include: 
Leon, Sole, Utzet, and Vives [ 85 ], Nualart and Schoutens [ 105 ], Malli¬ 
avin and Thalmaier [ 93 ], Karatsas and Shreve [ 66 ], Gulisashvili [ 60 ], 
El Karoui and Mazliak [ 51 ], El Karoui, Pardoux and Quenez [ 49 ], Lim 
[ 87 ]. Other references include Zhang and Zhou (editors) [ 155 ] and 
Tsoi, Nualart and Yin [ 138 ]. 

(11) Another interesting subject is “Ergodic theory” and, correspondingly, 
invariant measures. We mention some references: Krengel [ 75 ], Karlin 
and Taylor [ 67 ], Meyn and Tweedie [ 97 ], Eisner and Nagel [ 48 ], Van 
Casteren [ 146 ], Seidler [ 119 ], Goldys [ 58 ], [ 115 ]. 

(12) Central limit theorems and related results are also relevant. Again 
we mention some references: Bhattaraya and Waymire [ 15 ], Nourdin 
and Peccati [ 101 ], Barbour and Chen [ 8 ], Berckmoes, Lowen and Van 
Casteren [ 11 , 12 , 13 , 14 ], Tao [ 137 ], Stein [ 124 , 125 ], Chen, Goldstein 
and Shao [ 31 ], Barbour and Hall [ 9 ], 

(13) Investigate Markov processes with a Polish space as state space: see, 
e.g., Sharpe [ 120 ], Swart and Winter [ 134 ], Van Casteren [ 146 ], Bovier 

[ 27 ]. 

(14) Discuss and make a careful study of the Skorohod space as described 
in Remark 3.40. Try to include applications to convergence properties 
of stochastic processes. 

(15) Discuss stochastic analysis in the infinite-dimensional context. A nice 
and relevant survey paper is [ 150 ] written by van Neerven, Veraar and 
Weis. A simplified version in Dutch is authored by van Neerven: see 

[ 149 ], 
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